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(57) Abstract 

The present invention relates to the isolation, purification, characterization and use of the plant diacylglycerol acyitransferase (DGAT) 
gene and genetic products. For example, the invention provides DGAT cDNA [SEQ ID NO:l] (pDGATcDNA; ATCC No PTA-989) 
and a plant diacylglycerol acyitransferase gene [SEQ ID NO:3) (pDGATgene; ATCC No PTA-988) from the Brassicaceae (specifically 
Arabidopsis thaliana). The invention includes isolated and purified DGAT DNA, preferably of the stated sequences and homologues, and 
relates to methods of regulating seed oil content, die ratio of diacylglyceroVtriacylglycero] proportions in the seed oil, fatty acid synthesis, 
seed oil acyl composition, seed size/weight and carbon flux into other seed components, using the gene, and to tissues and plants transformed 
with the gene. The invention also relates to transgenic plants, plant tissues and plant seeds having a genome containing an introduced DNA 
sequence of the invention, and a method of producing such plants and plant seeds. The invention also relates to [SEQ ID NO:1] containing 
an 81 bp insertion [SEQ ID N023], and uses thereof to modify oil content, acyl composition of triacylglycerols, seed size or carbon flux 
into other seed components. 
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DIACYLGLYCEROL ACYLTRANSFERASE GENE FROM PLANTS 


TECHNICAL FIELD 

This invention relates to plant genes useful for the genetic manipulation of 
plant characteristics. More specifically, the invention relates to the identification, 
5 isolation and introduction of diacylgtycerol acyltransferase (DGAT) genes useful, for 
example, for altering the seed oil content, the ratio of diacylglycerol / triacylglycerol 
proportions in the seed oil. fatty acid synthesis, seed oil acyl composition, seed 
size/weight and carbon flux into other seed components, in commercial or crop 
plants. 

10 BACKGROUND ART 

Plant seed oils are major sources of essential polyunsaturated fatty acids for 
human diets and renewable feedstocks for chemical industries. The enzymes of the 
fatty add synthase complex in the plastids of developing seeds are responsible for 
the biosynthesis of fatty acids that are channeled into the cytosolic acyl-CoA pool to 
15 sustain triacylglycerol accumulation. Triacylglycerol (TAG) biosynthesis is located in 
the endoplasmic reticulum with glycerol-3-phosphate and fatty acyl-CoAs as the 
primary substrates. There are three acyltransferases involved in the plant storage 
lipid bioassembly, namely the giycerol-3-phosphate acyltransferase (GPAT, EC 
2.3.1.15), the /yso-phosphatidic acid acyltransferase (LPAT, EC 2.3.1.51) and the 
20 diacylglycerol acyltransferase (DGAT, EC 2.3.1 .20). These three acyltransferases 
catalyze the stepwise acylation of the glycerol backbone with the final step being the 
acylation of sn-1 , 2-diacylglycerol (DAG) by DGAT into the formation of TAGs, a 
biochemical process generally known as the Kennedy pathway (Stymne and 
Stobart, 1987). 

25 Among the three ER-based fatty acyl-CoA acyltransferases, only LPAT 

gene(s) have been cloned from plants (Knutzon et a/., 1995, Lassner et a/., 1995). 
Like several other enzymes involved in storage lipid biosynthesis, acyltransferases 
are intrinsic ER proteins and are extremely difficult to purify. The research on plant 
DGAT has been largely limited to studies of activity profiles by using the particulate 

30 fractions generated by differential centrifugation of seed or microspore-derived 
embryo homogenates (Weselake et a/., 1993). Although partial purification of 
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DGAT from cotyledons of germinating soybean seeds was reported (Kwanyuan and 
Wilson. 1986), detailed molecular characterization of this enzyme is lacking. 

Accordingly, while the Kennedy pathway is known and shows the steps in 
the biosynthesis of TAGs in plants, there has not been any identification and use of 
5 a genetic element that can be used reliably in plants to modify TAG synthesis and 
composition in a way that may be exploited commercially. 

DISCLOSURE OF THE INVENTION 

An object of the invention is to identify, isolate and clone a genetic element 
that may be used to modify the natural formation of triacytglycerofs in plants in order 
10 to increase the yield of commercial plant oils, or to modify their composition to 
achieve specific commercial improvements of plants and plant products. 

Another object of the invention is to identify, isolate and characterize 
diacyigiycerol acyltransferase (DGAT) gene and cDNA sequences from Arabidopsis 
and to utilize these sequences in the genetic manipulation of plants. 

15 Another object of the invention is to provide a vector containing the full- 

length DGAT cDNA sequence from Arabidopsis in a sense orientation under the 
control of a seed-specific promoter (e.g. napin; See Josefsson et al., 1987; Radke 
et al M 1988; Voelker et al., 1992), for re-introducing into Arabidopsis or for 
introducing into other plants. 

20 Another object of the invention is to provide a vector containing a genomic 

fragment from Arabidopsis consisting of the fulHength DGAT gene under the control 
of its own 5* upstream regulatory sequences, for re-introducing into Arabidopsis or 
for introducing into other plants. 

Another object of the invention is to provide a method to construct a vector 
25 containing the full-length DGAT sequence or a significant portion of the DGAT 
sequence from Arabidopsis, in an antisense orientation under control of either a 
constitutive or a seed-specific promoter, for re-introducing into Arabidopsis or for 
introducing into other plants. 
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Another object of the invention is to provide a method of modifiying 
Arabidopsis and other plants to change their seed oil content. 

Another object of the invention is to provide a method of modifiying 
Arabidopsis and other plants to change the acyl composition of their seed oil. 

5 Another object of the invention is to provide a method of modifiying 

Arabidopsis and other plants to change their average seed weight or size. 

According to one aspect of the present invention, there is provided a vector 
containing isolated and purified deoxyribonucleic acid (cDNA) of SEQ ID NO:1 
(pDGATcDNA; ATCC No PTA-989), for introduction of the cDNA in a sense 
10 orientation into a plant cell. 

As another aspect of the present invention, there is provided a vector 
containing isolated and purified genomic deoxyribonucleic acid (genomic DNA) of 
SEQ ID NO:3 (pDGATgene; ATCC No PTA-988), for introduction of the gene in a 
sense orientation into a plant cell. 

15 According to yet another object of the invention, there is provided a method 

for preparing a vector containing SEQ ID NO:1 or a part thereof, for introduction of 
the gene or partial gene in an antisense orientation, into a plant cell. 

According to yet another object of the invention, there is provided seed of 
Arabidopsis thaliana ecotype Columbia mutant AS11 (ATCC No. PTA-1013) and 

20 characterization of its lipid phenotype (Katavic et ah, 1995; Zou et al. 1999). The 
AS11 mutant seed line has an insertion mutation at the TAG1 locus on 
chromosome II, and produces plants exhibiting reduced DGAT activity (Figure 4) 
and an reduced TAG/DAG ratio during seed development (Table 1), resulting in an 
altered seed fatty acyl composition (Figure 2), reduced oil content (Table 1), and 

25 increased seed oil diacylglycerol content during development (Figure 3) and at 

maturity (lower TAG/DAG ratio cf Table 1). The cDNA sequence of the AS1 1 DGAT 
is shown in SEQ ID NO:23, the genomic DNA sequence is shown in SEQ ID NO:24 
and the translated protein sequence of the AS1 1 DGAT is shown in SEQ ID NO:25. 


SUBSTITUTE SHEET (RULE 26) 
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The invention also relates to transgenic plants and plant seeds having a 
genome containing an introduced DNA sequence of SEQ ID NO:1 or SEQ ID NO:3, 
and a method of producing such plants and plant seeds. 

The invention also relates to SEQ ID NO:1 or SEQ ID NO:3, or a part of 
5 SEQ ID NO:t or SEQ ID NO:3, or SEQ ID NO:1 containing an 81 bp insertion [SEQ 
ID NO: 23] or SEQ ID NO:3 containing an 147 bp insertion [SEQ ID NOr24J such 
that the deduced amino acid sequence of the encoded protein contains the 
repeated sequence SHAGLFNLCVWUAVNSRLS I ENLM K [SEQ ID NO:25] where 
the spacing and identity of the underlined amino acids are identical or are replaced 
10 by conserved substitutions, characterized in that said sequence has been 

introduced in sense or antisense orientation, and a method of producing such plants 
and plant seeds. 

As will be appreciated by persons skilled in the art, the invention also relates 
to substantially homologous DNA sequences from plants encoding proteins with 

1 5 deduced amino acid sequences of 25% or greater identity, and 40% or greater 
similarity, isolated and/or characterized and/or designed by known methods using 
the sequence information of SEQ ID NO:1 or SEQ ID NO:3 or or SEQ ID NO:1 
containing an 81 bp insertion [SEQ ID NO:23] such that the deduced amino acid 
sequence of the encoded protein contains the repeated sequence 

20 SHAGLFNLCVWLIAVNSRLIIENLMK (SEQ ID NO:25) where the spacing and 
identity of the underlined amino acids are identical or are replaced by conserved 
substitutions, and to parts of reduced length that are still able to function as 
inhibitors of gene expression by use in an anti-sense, co-suppression (Transwitch; 
Jorgensen and Napoli 1994) or other gene silencing technologies. It will be 

25 appreciated by persons skilled in the art that small changes in the identities of 
nucleotides in a specific gene sequence may result in reduced or enhanced 
effectiveness of the genes and that, in some applications (e.g. anti-sense or co- 
suppression), partiat sequences often work as effectively as full length versions. 
The ways in which the gene sequence can be varied or shortened are well known to 

30 persons skilled in the art, as are ways of testing the effectiveness of the altered 
genes. All such variations of the genes are therefore claimed as part of the present 
invention. 
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Other preferred degrees of identity to the indicated sequences are at least 
30%, 40%, 50%, 60%, 70%, 80%, 90% and 95%; and other preferred degrees of 
similarity are at least 50%, 60%, 70%, 80%, 90% and 95%. The inventors have 
used a computer program known as MegAlign®, DNASTAR® of DNASTAR Inc., 

5 1228 South Park Street, Madison, WI 53715, USA, for assessing homology. This 
program is based on the Clustal V algorithm (Higgins and Sharp (1998): A package 
for performing multiple sequence alignment on a microcomputer; GENE 73:237- 
244). For each gap introduced in the alignment, the program deducts a penalty 
from the score. A higher gap penalty suppresses gapping; a lower value promotes 

10 it. The program also assesses penalties based on the length of the gap. The more 
residues the gap spans, the greater the penalty. The program deducts these 
penalties from the overall score of the alignment. 

Stated more generally, the present invention relates to the isolation, 
purification and characterization of a diacylgiycerol acyltransferase (DGAT) gene 
15 from the Brassicaceae (specifically Arabidopsis thaliana) and demonstrates its utility 
in regulating fatty acid synthesis, seed oil content, diacylglycerol/triacylglycerol 
ratios and seed size/weight. Until now, no concrete data is available on the gene 
structure of plant DGATs, or their utility in altering oil content or composition through 
genetic manipulation. 

20 When considering altered oil contents or compositions, results from 

averages of statistically-significant numbers of plants or seeds according to the 
invention are best compared with results from averages of statistically-significant 
numbers of untransformed (control) plants or seeds of the same genotype grown 
under identical conditions at the same time. This allows for the variability of 

25 individual plants of the same genotype, particularly when such plants are grown 
under different conditions. The actual number of plants or seeds used to form the 
required average may vary, but should be enough to provide a generally constant 
average whenever such number is selected. Generally, the number should be at 
least 10, and is more preferably at least 20, 30, 50 or 100. 

30 The DGAT gene was cloned, characterized and authenticated from 

Arabidopsis by: (1 ) selection arid characterization of plant ESTs sharing some 
homology to mammalian acyl-CoA: cholesterol acyltransferases; (2) the functional 
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expression of a full-length cDNA in yeast; (3) the characterization and isolation of 
the DGAT (TAG1) gene from Arabidopsis mutant AS1 1 containing an insertion 
mutation in the DGAT gene and a seed oil phenotype which consists of an altered 
DAG/TAG ratio, and an altered oil content and acyl composition; (4) 
5 complementation of the AS1 1 mutant by insertion of the DGAT cDNA sequence to 
restore a wild-type fatty acid composition; (5) the over-expression of the DGAT 
cDNA in wild-type A. thaliana transgenic plants which produce seeds with an 
increased oil content, increased average seed weight and altered seed oil acyl 
composition. 

10 The A. thaliana DGAT structure is significantly homologous (over 40% 

identical over a region of more than 400 amino acids) to its mammalian 
counterparts, and is highly homologous to subsequently reported putative B. napus 
DGATs at both the nucleotide (92%) and the deduced amino acid (90%) levels 
(Nikytforuk et a), 1999; GenBank/EMBL Accession No AF155224; AF164434). 

1 5 The DGAT of the current invention is useful in manipulating DGAT activity, 

and triacyiglycerol bioassembly in plants. For example, by transforming plants with 
a construct containing the DGAT gene in a sense orientation, under the control of a 
tissue-specific promoter (e.g. seed-specific promoter napin), the expression of 
DGAT and accumulation of seed oil can be enhanced or the acyl composition of the 

20 seed oil altered. Yet another example would be to express the DGAT cDNA under 
the control of a constitutive promoter (e.g. 35S (Datla et al., 1993)) to increase the 
TAG content of vegetative tissues (leaves, roots, stems). This may have particular 
advantages for altering the starch/oil ratio in root crops. 

Alternatively, DGAT expression can be silenced to some degree by anti- 
25 sense or co-suppression (Transwrtch) phenomena (De Lange et al., 1995; Mol et 
al., 1990; Jorgensen and Napoli, 1994; Kinney, 1995; Vaucheret et al, 1998; Taylor, 
1998). For example, silencing DGAT in a seed specific manner may result in a 
reduction in TAG accumulation. This could have applications in reducing the oil 
content in seed barley to enhance stability during storage. As a second example, 
30 seed-specific silencing may lead to a relatively high accumulation of DAG or an 
increase in the DAG/TAG ratio in the developing or mature seed. As yet another 
example, the expression of the mutated DGAT gene which results in a 27 amino 
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acid repeat insertion in the mutant DGAT protein (See Figure 5 a) can be used to 
alter the DAG/TAG ratio in developing and mature seed. Such manipulations can 
lead to edible seed oils produced naturally in the plant, containing enhanced relative 
levels of DAG /reduced levels of TAG (See Figure 3; Table 1) to act as all-natural 

5 emulsifiers in the food and confections industries, or to enhance the 

nutritional/health profile of vegetable oils as functional foods (e.g. as cooking oils, 
stir fry oils, in salad dressings, margarines etc.) by inhibiting neutral fat deposition 
in humans. Processed oils produced from canola and soybean which contain 
increased proportions of diacylglycerol have been cited by the Kao Corporation of 

10 Japan (e.g. Econa Cooking Oil; Kao Corporation Kl, 1-14-10 Nihonbashi- 

Kayabacho. Chuoku, Tokyo 103 Japan; e-mail: 210064@kastanetkao.co.jp) as a 
product making it difficult for blood neutral fat to increase after a meal, and for fat to 
cling to the body, thereby assisting individuals who are overweight or who suffer 
from high neutral fat levels. As a third example, silencing or reducing the activity of 

15 DGAT in a seed specific manner (as observed in mutant AS1 1 ; e.g. by over- 
expressing SEQ ID NO:23 or silencing expression of SEQ ID NO:1 or SEQ ID 
NO:3), and combining this trait with the capacity to produce polyhydroxyalkanoates 
(PHAs; e.g. polyhydroxybutyrate) in seeds (Poirier et al., 1992; 1995) will allow an 
increased flow of unesterified fatty acids towards p-oxkJation (Poirier et al., 1999). 

20 By recycling or diverting the unesterified fatty acids into jJ oxidation, the resulting 
acetyl-CoA moieties will lead to a significant increase in polyhydroxyalkanoates 
(PHAs) or a change in PHA composition (Poirier et al., 1999). Transgenic plants 
producing PHAs in seeds have potential for utility as biodegradeable thermoplastics. 
However, up to now, the levels of PHAs produced have been relatively small 

25 (Poirier et al, 1992; 1995). The utility of transgenically reducing the DGAT activity to 
significantly enhance PHA production (e.g. 10-fold increase) in PHA-producing 
seeds is now possible, due to the current DGAT invention. 

Some of the manipulations and deliverables which are possible using the 
DGAT gene or a part therof, include, but are not limited to, the following: seeds with 
30 increased or decreased oil content; seeds containing oils with an enhanced 

diacylglycerol content, seed oils with an altered acyl composition; plants producing 
larger or heavier seeds; plants exhibiting an enhanced or altered capacity to 
accumulate storage compounds in other storage organs (e.g. tubers, roots). 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a diagram illustrating the Kennedy pathway for the bioassembly 
of triacylglycerols in plants, and shows the critical rote played by DGAT as the final 
step of the Kennedy pathway. 

5 Figure 2 is a graph showing the comparison of the fatty acid composition of 

seed oil from wild-type (WT) and DGAT mutant AS1 1 lines of Arabidopsis thaliana. 
Proportions of fatty acids are reported as Mol % of the total fatty acid composition of 
the seed oil from each line. Error bars are ± SE (n= 10 plants of each line sampled, 
with 50 seeds per sample per analysis). 

10 Figure 3 is a graph showing a comparison of the DAG content in developing 

seed of wild-type (WT) and DGAT mutant AS1 1 lines of Arabidopsis thaliana. More 
specifically, it is a comparison of the fatty acid content of DAG pool in wild-type 
green developing seeds compared to that of the AS1 1 mutant. 

Figure 4 is a graph showing a comparison of the DGAT activity 
15 (pmol/min/mg protein) in developing seeds at the milky, early green, mid-green and 
brown-green stages of embryo development in wild-type (WT) and DGAT mutant 
AS1 1 lines of Arabidopsis thaliana. Developing seeds at each stage were selected 
and DGAT enzyme analyses conducted as described previously by Katavic et al., 
1995). 

20 Figure 5(a) shows the amino acid sequence alignment of the Arabidopsis 

DGAT (AtTAGI) [SEQ ID NO:2] with mammalian (mouse and putative human) 
DGATs. MDGAT, mouse DGAT [SEQ ID NO:4]; GenBank/EMBL Accession No. 
AF078752 (Cases et a/., 1998)]; HARGP1, human ARGP1 protein [SEQ ID NO:5]; 
GenBank/EMBL Accession No. AF059202; Oelkers et al. t 1998]. Dots indicate 

25 gaps. Identical amino acid residues are highlighted in black. Conserved residues 
are shaded. The 27-amino acid repeat found in Arabidopsis thaliana mutant AS1 1 
and generated by the insertion mutation (81 bp) found in SEQ ID NO:23 

(SHAGLFNLCVWLIAVNSRLHENLMK) is overlined thus: . The putative 

diacylglycerol binding site is overlined thus: — — — — . The SnRK1 targeting site is 

30 overlined thus: ====== with an asterisk (*) over the serine (S) residue as the 

targeted phosphorylation site. 
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Figure 5(b) shows the Kyte-Doolittle hydropathy plot of the DGAT protein. 

Figure 6(a) shows the results of a Northern analysis of TAG1 gene 
expression in Arabidopsis thaliana. Total RNA was extracted from roots (RT), 
leaves (LF) t flowers (FL), young seedlings (YS) t developing siliques (SL) ( and 
5 germinating seeds (GS). 

Figure 6(b) shows the results of Southern blot analysis of the TAG1 gene in 
Arabidopsis thaliana. Genomic QNA was digested with restriction enzymes Bglll 
(Lane 1), EcoRI (Lane 2), and Hindlll (Lane 3). The TAG1 DNA probe was *P 
labeled by random priming. 

10 Figure 7(a) is a diagrammatic representation of the TAG1 gene structure. 

The boxes indicate the 16 exons (solid boxes for coding regions, open box for 
untranslated regions), and the lines represent the 15 introns. A, B and C denote the 
positions of the primers used for PCR amplifications of the segments from wild type 
(WT) and AS1 1 . The specific primers A, B and C are described in Experimental 

15 Procedures: Primer Strategy (found later in this specification). 

Fig 7(b) shows gel separation of the PCR products amplified from wild type 
(WT) and AS1 1 . Lane 1 f PCR product with primers A and B using WT genomic 
DNA as template. Lane 2, PCR product with primers A and B using AS1 1 genomic 
DNA as template. Lane 3, PCR product with primers C and B using WT genomic 
20 DNA as template. Lane 4, PCR product with primers C and B using AS1 1 genomic 
DNA as template. Lane 5, RT-PCR with primers A and B using RNA prepared from 
WT seedling RNA. Lane 6, RT-PCR with primers A and B using RNA prepared 
from AS 11 seedling RNA. 

Figure 8 is a graph showing microsomal DGAT activity in Yeast Strain YMN5 
25 Transformed with empty plasmid (pYES Con) and with the A. thaliana DGAT cDNA 
(pYES:DGAT). This illustrates the expression of the TAG1 cDNA in yeast. Host 
cultures of strain YMN5 were transformed with pYES2 plasmid only (pYES2\ without 
TAG1 insert) or with pYES2 containing the TAG1 cDNA insert (pYES2:TAG1). 
Following induction in the presence of galactose, transformants were lysed and 
30 assayed for DGAT activity as described in the Experimental Procedures. The 
results of two separate DGAT activity experiments are shown. 
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Figure 9 is a map of plasmid pSE129A which may be used as a vector. The 
vector contains the following salient features for plant transformation in the current 
invention: the seed-specific napin promoter and NOS terminator between which is a 
multiple cloning site. 

5 Figure 1 0 is a graph showing the complementation of the AS1 1 DGAT 

mutation with the wild-type cDNA. Transformation of Arabidopsis thaliana mutant 
line AS1 1 with the DGAT cDNA [SEQ ID NO:1] under the control of a napin 
promoter, leads to a restoration of the wild-type (WT) fatty acid composition in the 
seed oil of the transformant lines 3-4, 9-1, 14-2 and 9-4. Fatty acid composition (wt 

10 %) was determined on the seed oil extracted from 100-seed samples from A. 
thaliana non-transformed controls (WT), mutant line AS1 1 , and T 2 seeds of 
napin:DGAT transgenic lines. 

Figure 1 1 is a graph showing the seed oil content of non-transformed WT 
control, and pRD400 control (empty plasmid) and napinrDGAT T 2 transgenic 

15 Arabidopsis thaliana seed lines. More particularly, the graph shows the 

transformation of wild type (WT) Arabidopsis thaliana with the DGAT cDNA [SEQ ID 
NO:1] under the control of a napin promoter, leads to a higher seed oil content in 
the DGAT transgenic lines. Oil content is expressed as ^g total fatty acids (FAs) per 
100 seeds from A. thaliana non-transformed controls (WT Con), and T 2 seeds of 

20 pRD400 control (empty plasmid) transgenic, and napin:DGAT transgenic lines 1\ 2\ 
9, 10 and 11. Standard error bars for the control lines are indicated; n=10. 

Figure 12 is a graph showing the average 100-seed weight of non- 
transformed WT control, and pRD400 control (empty plasmid) and napin:DGAT T 2 
transgenic Arabidopsis thaliana seed lines. More specifically, the graph shows the 
25 over-expression of the DGAT cDNA under the control of a napin promoter, in wild- 
type (WT) Arabidopsis thaliana leads to a higher average seed weight. The average 
weight of 100-seed samples from A thaliana non-transformed controls (WT Con), 
and T 2 seeds of pRD400 control (empty plasmid) transgenic, and napin:DGAT 
transgenic lines 1\ 2\ 9, 10 and 1 1 are reported as mg dry weight (DW). 

30 Figure 13 is a graph showing the positive correlation between oil content 

(expressed as \ig Total fatty acids (FAs) per 100 seeds) and average seed weight 
(expressed as average mg DW per 100-seed samples) from A. thaliana non- 
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transformed controls (WT Con ♦), and 12 seeds of pRD400 control {empty 
plasmid) transgenic {() and napin:DGAT transgenic lines 1\ 2\ 9, 10 and 1 1 ( ■ ) are 
reported as mg dry weight (DW). 

BEST MODES FOR CARRYING OUT THE INVENTION 

5 Fig. 1 is a diagram illustrating the Kennedy pathway for the biosynthesis of 

TAGs in plants. Of the various illustrated enzymes, DGAT is the only enzyme in 
the Kennedy pathway that is exclusively committed to TAG biosynthesis, and its key 
role is apparent from the scheme of Figure 1 . sn-1 ,2-DAG, generated as a result of 
either the catalytic action of PA phosphatase (EC 3.1.3.4) or CPTase (EC 2.7.8.2), 

10 can be used in the biosynthesis of TAG. 

For this reason, the inventors of the present invention decided to investigate 
DGAT to see if the corresponding gene in plants could be sequenced and cloned 
and used to modify the seed oil content and fatty acid composition of plants in a 
way that could be commercially useful. 

15 The acyl-CoA dependent acylation of sn-1 ,2-DAG is catalyzed by DGAT 

(Stymne and Stobart, 1987). In developing and germinating seeds of oilseed 
plants, TAG accumulation and DGAT activity have been shown to associate with the 
endoplasmic reticulum (ER; high speed microsomal fraction) (Stobart et al., 1986; 
Cao and Huang, 1986; Stymne and Stobart, 1987; Frentzen, 1993; Settlage et al, 

20 1995; Lacey and Hills, 1996). The biochemical properties of microsomal DGAT have 
been examined in a number of plant systems {Frentzen, 1993) including developing 
seeds (Bernerth and Frentzen, 1990; Vogel and Browse, 1996; Cao and Huang, 
1987) and embryo cultures (Taylor et al., 1991 ; Weselake et al M 1991; Taylor et al., 
1992; Little et al., 1994) of B. napus L. In general, studies with developing seeds 

25 indicate that DGAT activity increased rapidly during the active phase of oil 

accumulation and then decreases markedly as seed lipid content reaches a plateau 
(Tzen et al., 1993; Weselake et al., 1993). 

A number of studies with both mammalian (Mayorek et al., 1989; Tijburg et 
al., 1989) and plant (Ichihara et al., 1988; Perry and Harwood, 1993 a and 1993 b; 
30 Settlage et al. 1995) systems have suggested that DGAT may catalyze a rate- 
limiting reaction in TAG bioassembly. However, this hypothesis has not been 
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rigorously tested, and has not been reduced to practice by transgenic expression of 
any DGAT gene in plant or animal systems, until now. Developing seeds of 
B. napus L, cv Shiralee, have been shown to produce significant levels of DAG 
during the active phase of oil accumulation suggesting that DGAT catalyzed 

5 reaction may regulate the flow of carbon into TAG (Perry and Harwood, 1993 a and 
1993 b). In addition, an ethyl methanesulfonate-induced (EMS-induced) mutant of 
A. thatiana, designated AS11, has been shown to have a reduced DGAT activity 
that correlated with both an increased DAG pool and decreased accumulation of 
TAG (Katavic et al. 1995). Given its possible rate-limiting role in TAG bioassembly, 

10 the inventors of the present invention have identified DGAT as a potential target in 
the genetic modification of plant lipid biosynthesis. 

Previously, the partial characterization of an EMS-induced Arabidopis 
thatiana mutant, AS1 1, with altered fatty acid composition was reported (Katavic et 
a/., 1995). In comparison to wild type plant seeds, AS11 seeds have reduced levels 

15 of the very long chain fatty acid eicosenoic acid (20:1 ) and reduced oleic acid (1 8:1 ) 
and accumulate a-linolenic acid (18:3) as the major fatty acid in triacylglycerols 
(Figure 2). The AS1 1 mutant has a consistently lower ratio of TAG/DAG in 
developing seeds, and it accumulates an elevated amount of seed DAG (Figure 3), 
the substrate of the diacylglycerol acyltransferase. Through a series of biochemical 

20 analyses, it was shown that AS1 1 had reduced diacylglycerol acyltransferase 
activities throughout seed development (Figure 4). AS11 also had a reduced (by 
25-30%) oil content phenotype, providing some evidence that DGAT may be 
controlling flux into TAG biosynthesis, as shown in Table 1 below. The AS1 1 did not 
have a wrinkled-seed phenotype as described in other low-seed-oil mutants (Focks 

25 and Benning, 1998). 
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Table 1 


Comparison of AS1 1 [Katavic et al. t (1995) and wild-type A. thaliana seeds 
with respect to lipid profiles at mid-development, and the relative TAG, DAG and 
sterol ester contents in AS1 1 and WT seeds at maturity. 


Seed Type 

TAG/DAG ratio 

at mid- 
development 3 

TAG/DAG ratio 
at maturity 3 

Relative TAG 
content at 
maturity 6 
c (nmol/mg DW) 

Sterol Esters at 
maturity (% of 
Totol Lipid 
Extract)" 

WT 

17 

90 

1.00" 

0.8 




e (255) 


AS11 

5 

20 

0.6 b 

1.15 


c (174) 


8 Embryos staged and lipids measured as described in Katavic et ah, (1995); 
b Relative TAG content of 200-seed samples of AS1 1 and WT were meaured by MASS-1H- 
NMR according to the method of Rutar (1989). The integration response for resonances 
attr&utabe to liquid-like oil were summed and the value for AS1 1 seed is reported relative to 
the response for the WT control seed sample (the latter set at a value of 1 .00); C TAG content 
(nmoles/mg DW) measured by transmethylation of a TLC-purified TAG fraction, followed by 
GC analysis of fatty acid methyl esters; d A total lipid extract was prepared as dedscribed by 
Taylor et al., (1991 ; 1 992), and sterol esters isolated and characterized as described in the 
Experimental Procedures. 

Genetic analysis indicated that the fatty acid phenotype is caused by a 
semidominant mutation in a nuclear gene, designated TAG1. The mutation was 
mapped to chromosome II, and was estimated to lie in the region approximately 
17.5 + 3 cM from the sti locus and 8 + 2 cM from the cp2 locus. 

5 Because a DGAT gene has not heretofore been cloned from any plant, until 

now, it has not been possible to address the possibility of genetic modifications to 
alter carbon flux, increase fatty acid synthesis, oil content, oil acyl composition, or 
seed size, by modulating plant DGAT activity. 

However, there are many examples of successful modifications to plant 
10 metabolism that have been achieved by genetic engineering to transfer new genes 
or to alter the expression of existing genes, in plants. It is now routinely possible to 
introduce genes into many plant species of agronomic signtflcance to improve crop 
performance (e.g. seed oil or tuber starch content/composition; meal improvement; 
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herbicide, disease or insect resistance; heavy metal tolerance etc.) (MacKenzie and 
Jain, 1997; Budziszewski et al., 1996; Somerville, 1993; Kishore and Somerville, 
1993). 

For example, increases in the proportions of some strategic fatty acids and 
5 in the quantities of seed oil have been achieved by the introduction of various fatty 
acid biosynthesis and acyttransferase genes in oilseed crops. These include the 
following demonstrations: Expression of an anti-sense construct to the stearoyl- 
ACP A9 desaturase in Brassicaceae led to an increase in the stearic acid content 
(Knutzon et al., 1992). Expression of a medium chain fatty acyl-ACP thioesterase 
10 from California Bay, in Brassicaceae was demonstrated to increase the lauric acid 
(12:0) content (Voelker et al., 1992; 1996). Expression of a Jojoba p keto-acyl-CoA 
synthase in low erucic acid Brassicaceae led to an increase the level of erucic acid 
(22:1); the effect following expression in high erucic acid cultivars was negligible 
(Lassner et al., 1996). Increased proportions of oleic acid in Brassica napus and in 
15 soybean have been achieved by silencing the microsomal FAD2 (A12) desaturase 
(Hitz et al., 1995; Kinney, 1995 ; 1997). Transformation of Arabidopsis thaliana and 
rapeseed (B. napus) with a yeast sn-2 acyltransferase resulted in seed oils with 
increased proportions of 22:1 and other very long-chain fatty acids and significant 
increases in seed oil content (Zou et al., 1997). 

20 Starch deposition has also been altered by genetic engineering. By 

expression of a mutant E. coli glgC16 gene encoding an ADP glucose 
pyrophosphorylase in potato tubers, an increase in starch accumulation was 
achieved (Stark et al., 1992). 

The inventors therefore considered the DGAT gene to hold great promise for 
25 the desired modification of TAGs in plants. 

The best modes for carrying out the invention will be apparent from the 
following description of the results of tests and experiments that have been carried 
out by the inventors. 

The inventors chose to use the well-accepted model plant system 
30 Arabidopsis thaliana for the cloning of DGAT, as a host system for genetic 

engineering to alter DGAT expression, and to study the effects of altering DGAT 
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expression on seed triacylglycerol bioassembty. This is because, over the past 
several years, Arabidopsis thaliana, a typical flowering plant, has gained increasing 
popularity as a model system for the study of plant biology. As a result of the ease 
with which this plant lends itself to work in both classical and molecular genetics, 

5 Arabidopsis has come to be widely used as a model organism in plant molecular 
genetics, development, physiology and biochemistry (Meyerowitz and Chang, 1985; 
Meyerowitz, 1987; Goodman et al., 1995). This model dicotyledonous plant is also 
closely related to Brassica crop species and it is increasingly apparent that 
information concerning the genetic control of basic biological processes in 

10 Arabidopsis will be transferable to other species (Lagercrantz et al, 1 996). 

Indeed, there are numerous examples wherein studies of the molecular 
biology and biochemistry of a particular metabolic pathway or developmental 
process and the possibility of genetically engineering a plant to bring about changes 
to said metabolic pathway or process, has first been tested in the model plant 
15 Arabidopsis, and then shown to yield similar phenotypes in other plants, particularly 
crop plants. 

For example, the extra- plastidlal membrane associated oleate (18:1) A12 
(o>-6) desaturase gene, FAD2, was originally studied and eventually cloned from 
Arabidopsis thaliana, by identifying the lesion found in an A. thaliana mutant 

20 defective in desaturating oleate to produce linoleate (18:2) on the 

phosphatidylcholine backbone. This resulted in a high oleic acid phenotype in the A. 
thaliana seed oil (Okuley et al., 1994). Genetic engineering of both soybean 
(Glycine max.) and canota B. napus to silence the indigenous FAD2 gene(s) in a 
seed-specific manner by anti-sense or co-suppression approaches, resulted in 

25 similar high oleic acid seed oil phenotypes (Kinney, 1995; 1997). 

Transgenic expression of a yeast sn-2 acyltransferase (SLC1-1) gene to 
achieve enhanced seed oil and very long-chain fatty acid content was first 
performed in Arabidopsis and later shown to yield similar phenotypes in transgenic 
rapeseed (B. napus) experiments (Zou et al., 1997). Arabidopsis thaliana has 
30 repeatedly shown itself to be a useful model system for metabolic engineering of 
metabolic pathways (e.g. lipid biosynthesis, photosynthesis) or processes 
(organogenesis, reproductive development etc.) common to all higher plants. 
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In the area of secondary metabolism/signal transduction, an anthocyanin 
pathway-specific transcriptional activator from the monocot maize designated as R 
(the myc transcription factor involved in activation of biosynthetic genes for 
anthocyanin production in the aleurone cells of maize kernels,), was expressed in 

5 the dicot Arabidopsis, causing augmented anthocyanin pigmentation in the 
infloresecences. Subsequent expression in another dicot, tobacco (Nicotiana 
tabacum), resulted in similar floral pigmentation changes (Lloyd et al., 1992). These 
experiments demonstrate that whole pathways common to all flowering plants can 
be co-ordinately controlled through the introduction of transcriptional regulators, and 

1 0 that the mechanisms are common to diverse plant species. 

In the context of the current invention, all plant seeds accumulate some 
triacylgtycerol (oil) and this ubiquitous process is affected, at least in part, by the 
activity of a microsomal DGAT, as explained previously. Thus, many of the effects 
observed following genetic engineering to modulate DGAT expression in 

15 Arabidopsis can be expected and predicted to result in similar phenotypes when 
carried out in all other plants. For example, after the present invention was made, 
information has become available that supports the findings of the present inventors 
by showing that B. napus has a highly homologous DGAT gene (Nikiforuk et al., 
1999), and thus B. napus is a clear target for similar genetic modifications as those 

20 shown for A. thaliana. 

There are a number of ways by which genes and gene constructs can be 
introduced into plants, and a combination of plant transformation and tissue culture 
techniques have been successfully integrated into effective strategies for creating 
transgenic crop plants. These methods, which can be used in the present 

25 invention, have been extensively reviewed elsewhere (Potrykus, 1991; Vasil, 1994; 
Walden and Wingender, 1995; Songstad et al., 1995), and are well known to 
persons skilled in the art. For example, one skilled in the art will certainly be aware 
that, in addition to ^grobacterium-mediated transformation of Arabidopsis by 
vacuum infiltration (Bechtold et al., 1993) or wound inoculation (Katavic et al., 

30 1994), it is equally possible to transform other plant and crop species, using 

Agmbacterium Ti-plasmid-mediated transformation (e.g. hypocotyl (DeBlock et al., 
1989) or cotyiedonary petiole (Moloney et al, 1989) wound infection), particle 
bombardment/biolistic methods (Sanford et al., 1987; Nehra et al., 1994; Becker et 
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al., 1994) or polyethylene glycol-assisted protoplast transformation (Rhodes et al. t 
1988; Shimamoto et al. ( 1989) methods. 

As will also be apparent to persons skilled in the art, and as extensively 
reviewed elsewhere (Meyer, 1995; Datla et al., 1997), it is possible to utilize plant 

5 promoters to direct any intended up- or down-regulation of transgene expression 
using constitutive promoters (e.g. those based on CaMV35S), or by using 
promoters which can target gene expression to particular cells, tissues (e.g. napin 
promoter for expression of transgenes in developing seed cotyledons), organs (e.g. 
roots), to a particular developmental stage, or in response to a particular external 

10 stimulus (e.g. heat shock). 

Particularly preferred plants for modification according to the present 
invention include Arabidopsis thaliana, borage (Borago spp.), Canola, castor 
(Ricinus communis), cocoa bean (Theobroma cacao), corn (Zea mays), cotton 
(Gossypium spp), Crambe spp., Cuphea spp. ( flax (Linum spp.), Lesquerella and 

15 Limnanthes spp., Linola, nasturtium (Tropaeolum spp.), Oenothera spp., olive (O/ea 
spp.), palm {Elaeis spp,), peanut (Arachis spp.), rapeseed, safflower (Carthamus 
spp.), soybean (Glycine and Soja spp.) ( sunflower {Helianthus spp.), tobacco 
(Nicotiana spp.), Vernonia spp., wheat (Triticum spp.), barley (Hordeum spp.), rice 
(Oryza spp.), oat (Avena spp.) sorghum (Sorghum spp.), rye (Secale spp.) or other 

20 members of the Gramineae. 

The present invention is particularly useful when used to modify the yield or 
composition of oilseed produced from oilseed crops. Oilseed crops are plant 
species that are capable of generating edible or industrially useful oils in 
commercially significant yields, and include many of the plant species listed above. 
25 Such oilseed crops are well known to persons skilled in the art. 

RESULTS 

Isolation of the TAG1 (DGAT) cDNA from Arabidopsis thatiana 

Since one of the most likely defects in AS1 1 mutant is at the DGAT itself 
(Table 1 ; Fig 4), the inventors attempted cloning strategies based on sequence 
30 information of enzymes that share common substrates with DGAT. One of the 
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candidate enzymes that would serve this purpose is the acyl-CoA: cholesterol 
acyltransferase (ACAT, EC 2.3. 1 .26) (Chang et a/. , 1 997), Like DGAT, ACAT is an 
ER protein functioning as an O-acyltransferase by using acyt-CoA as the fatty acyl 
donor for the esterification of free cholesterol to generate sterol esters. Through a 
5 BLAST database search, the inventors identified an Arabidopsis thaliana expressed 
sequence tag (EST) [accession no. AA042298; SEQ ID NO:6J with a deduced 
amino acid sequence showing 41 % identity to that of the yeast acyt-CoA: 
cholesterol acyltransferase (Yang et a/., 1996, Yu eta/., 1996), within the short 
sequence (104 amino acids) that was available for the EST. 

1 0 The corresponding cDNA (E6B2T7) clone was obtained from the 

Arabidopsis Biological Resource Center, Columbus, Ohio. Upon complete 
sequencing, the 878 bp E6B2T7 clone was found to be a partial cDNA. However, 
the ORF prediction from this partial cDNA confirmed the initial EST search results in 
that the encoded product is structurally similar to ACAT, especially in the regions at 

15 the Oterminus. The inventors were confident that the cDNA contained the 3 1 
untranslated region through an ORF search, although the polyA tail was missing. 

The inventors further used the partial cDNA sequence to search against 
Arabidopsis thaliana genomic sequence information. An Arabidopsis 'IGF BAC 
clone 'F27F23' [accession no. AC003058] was identified to include a region that 

20 matched the cDNA, and therefore it was concluded that this was the region 
encompassing the corresponding gene. Moreover, this BAC clone , F27F23\ is 
contained in the YAC clone, CIC06E08, which, according to the published map 
position (http://weeds.mgh.harvard.edu/goodman/c2_b.html), represents a region 
between centimorgan 35.9 and centimorgan 38.7 on chromosome II; this position is 

25 similar to the estimated location for TAG1, and the lesion identified by the mutation 
in AS1 1 (Katavic et a/., 1 995). In view of our previous results on the 
characterization of the AS 11 mutant, the map position of this gene strongly 
suggested that it may encode a DGAT. 

To clone a full-length cDNA, a series of oligonucleotide primers were 
30 designed, based on the genomic sequences located at different positions 5 
upstream of the region covering the partial cDNA. We used these primers in 
combination with a primer located at the 3' UTR of the partial cDNA (E6B2T7) to 
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perform PCR reactions with cDNA phagemid prepared from an Arabidopsis thaliana 
(ecotype Columbia) silique-specific cDNA library (Giraudat et al., 1992) as a 
template. The longest cDNA amplified was 1904 bp, which we subsequently 
designated as TAG1, and deposited into the Genbank under accession AJ238008 

5 [SEQ ID NO:1]. We believe this cDNA represents a full-length clone because its 
size is in agreement with that of the transcript detected in the northern blot (see 
Figure 6 a). The longest open reading frame is flanked by a 134-nucIeotide 5* 
untranslated region and a 203-nucleotide 3' untranslated region. There is an in- 
frame stop codon (TGA at position nt-43) which is followed by an in-frame ATG at 

10 position nt-1 39. It is thus inferred that the ATG at position nt-1 39 is the initiation 
codon. 

The primary structure of TAG1 predicts a DGAT-related enzyme 

The predicted open reading frame of the TAG1 cDNA encodes for a 
polypeptide of 520 amino acids with a calculated molecular weight of 58993 

15 Daltons. With the BLAST search program (Altschul et a/. 1990), it was found that 
the recently reported mouse diacyiglycerol acyltransferase [accession no. 
AF078752] (Cases etaL, 1998) is a protein which showed the highest sequence 
similarity to the deduced amino acid sequence of TAG1 (Figure 5a). TAG1 was 
also similar to a human acyl CoA: cholesterol acyltransferase-related enzyme 

20 [accession no. AF059202]. The human acyl CoA: cholesterol acyltransferase- 
related enzyme, also known as ARGP1 , is most likely to be a DGAT with no 
significant ACAT activity, although the true nature of the enzyme awaits further 
confirmation (Oelkers et a/., 1998). The similarity between TAG1 and the 
mammalian DGAT extends over a region of more than 400 amino acids with a 

25 sequence identity of about 41%. A putative diacylgiycerol/phorbol ester-binding 
motif, HKW-X-X-RH-X-Y-X-P, a signature sequence observed to be unique to 
DGAT while absent in the ACATs (Oelkers et a/., 1998), is located at amino acids 
414-424 ([SEQ ID NO: 7]; Figure 5 a). This diacyiglycerol binding motif is also found 
in the subsequently published S. napus DGAT sequences (Nikyiforuk et al, 1999; 

30 GenBank /EMBL Accession Nos. AF1 55224, SEQ ID NO:8; AF164434, SEQ ID 
NO:9). Among other cloned acyltransferases (e.g. GPATs, LPATs, 
dihydroxyacetone phosphate acyltransferases) it has been reported that there is an 
invariant proline in a highly hydrophobic block IV that may participate in acyl-CoA 
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binding (Lewin et al., 1999). In the TAG1 sequence, the hydrophobic block from 
residues 221-229 containing an invariant proline at residue 224, might constitute 
such a motif. 

TAG1 showed some sequence similarity to other acyi CoA: cholesterol 
5 acyltransferases from a number of species (Chang et a/., 1997). However, the 
similarity is largely confined to the C-terminus and is lower (around 30%) than is the 
similarity of TAG1 to the mammalian DGAT. 

The TAG1 protein has multiple hydrophobic domains (Figure 5b) and an 
analysis by the PC Gene program predicted that the protein has 5 possible 

10 transmembrane segments (amino acids 178-195, 233-253, 363-388, 433-476, 486- 
507). In the mammalian DGAT, a putative tyrosine phosphorylation motif was 
observed (Cases etai, 1998), but no apparent tyrosine phosphorylation site could 
be found in TAG1. However, a visual examination revealed a consensus sequence 
(X-L^-X-K^X-X-S^-X-X-X-V 209 ; SEQ ID NO:10) identified as a targeting motif 

15 typical of members of the SnRK1 protein kinase family, with serine residue 205 
being the residue for phosphorylation. The SnRK1 (SNF1-related protein kinase-1) 
proteins are a class of Ser/Thr protein kinases that have been increasingly 
implicated in the global regulation of carbon metabolism in plants (Halford and 
Hardie, 1998). This concensus SnRK1 targeting motif is also found in the 

20 subsequently published B. napus DGAT sequences (Nikyiforuk et al, 1999; 
GenBank /EMBL Accession Nos. AF1 55224; AF1 64434). Interestingly, similar 
SnRK1 targeting motifs could also be identified in the (yso-phosphatidic add 
acyltransferases (LPATs) from coconut (Knutzon et a/ M 1995) and meadowfoam 
(Lassner eta/ M 1995), respectively. 

25 The TAG1 gene is ubiquitously expressed in Arabidopsis 

Northern blot analyses were performed to investigate the expression profile 
of the TAG1 gene. Total RNA was extracted from different tissues, including roots, 
leaves, flowers, developing siliques, young seedlings and germinating seeds. The 
highest steady-state level accumulation of TAG1 transcript was in RNA isolated 
30 from germinating seeds and young seedlings (Figure 6a). TAG1 transcripts were 
also detected in root, leaf and flower tissues, albeit at lower levels. Surprisingly, the 
TAG1 gene is expressed in developing siliques at a level that is comparable to that 
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of other vegetative tissues, but lower than that of germinating seeds and young 
seedlings. This expression profile in general is not inconsistent with the notion that 
DGAT is present in all plant tissues capable of TAG biosynthesis (Kwanyuan and 
Wilson, 1986). It has been shown in a number of plant species, including soybean 
5 and safflower, that germinating seeds actively synthesize TAGs (Ichihara and Noda, 
1981; Kwanyuan and Wilson, 1986; Wilson and Kwanyuan, 1986). The relatively 
high level of expression in roots is also consistent with the fact that root plastids are 
capable of synthesizing large amounts of triacylglycerol (Sparace et al. a 1992). 

Southern blot hybridization (Southern, 1975) was performed with genomic 
10 DNA digested with several restriction enzymes including Bglll, EcoRI and Hindlll. 
The TAG1 gene has no internal Bglll and Hindlll site, while one internal EcoRI site 
exists. Our Southern analysis suggested that TAG1 most likely represents a single 
copy gene in the Arabidopsis genome (Figure 6b). 

An insertion mutation is found in the TAG1 gene in mutant AS11 

15 Alignment of the genomic sequence (accession no. AC003058; SEQ ID 

NO:3) with that of the TAG1 cDNA [SEQ ID NO:1] revealed that the TAG1 gene 
contains 16 exons and 15 introns, spanning a region of about 3.4 kb (Figure 7a). 
DNA containing the TAG1 allele from AS11 was PCR-ampIified and completely 
sequenced. The AS1 1 TAG 7 allele has a 147-bp insertion located at the central 

20 region of intron 2. The insertion is a duplication of a segment that is composed of 
12 bp from the 3* end of intron 1 , the entire sequence of exon 2 (81 bp) and 54 bp 
from the 5' end of intron 2 (Figure 7a). 

In order to rule out the possibility of PCR artifacts, two sets of primers were 
used to perform further PCR amplifications. Primers A and B (see Experimental 

25 Procedures, Primer Strategy) located in exons 1 and 3, respectively, amplified a 
DNA fragment that is about 150 bp longer from AS1 1 (Figure 7b, lane 2) than that 
from the wild type (Figure 7b, lane 1). The second pair of primers, C and B 
(Experimental Procedures), with one to be found in both exon 2 and the insertion 
segment, and the other located in exon 3, generated two amplified fragments from 

30 AS1 1 (Figure 7b, lane 4), while only one from the wild type (Figure 7b, lane 3). 
Hence these results confirmed that the insertion mutation the inventors identified 
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through sequencing, reflected the true nature of the mutation in the TAG1 gene in 
the AS1 1 genome. 

The AS1 1 TAG1 transcript has an 81-bp insertion in its open reading frame 

Northern blot analyses indicated that there was no difference in the 

5 expression profiles of the TAG1 gene, between the AS1 1 mutant and wild type A. 
thaliana. In order to investigate the effect of the mutation at the transcript level, 
reverse-transcription PCR (RT-PCR) was performed to amplify the TAG1 transcript 
from RNA extracted from germinating seedlings of mutant AS1 1 . Sequencing 
analysis revealed that there is an 81 -bp insertion composed entirely of exon 2 in the 

10 transcript from AS1 1 . The exon 2 in the repeat is properly spliced. The alteration of 
the transcript thus does not disturb the reading frame. However, this additional 
exon 2 sequence in the AS1 1 transcript would result in an altered DGAT protein 
with the 27 amino acid insertion 131 SHAGLFNLCVWLIAVNSRLIIENLMK 157 [SEQ ID 
NO:11]. The inventors 1 data shows that this insertion results in a 40-70% reduction 

15 in DGAT activity throughout seed development (Katavic et al. v 1995). The 81 bp 
insert responsible for reduced DGAT activity in AS11 is visible in the comparison of 
RT-PCR products (Figure 7b: Compare lane 5 (WT) and lane 6 (AS11).) The DMA 
aberration observed in the AS1 1 mutant was unexpected, since ethyl 
methanesutfonate (EMS) generally causes point mutations. Although we cannot 

20 rule out the possibility that this AS1 1 mutant was the result of a spontaneous 

mutation event, EMS-induced deletions and insertions have been reported in other 
systems (Mogami ef a/., 1986, Okagaki etaL, 1991) 

7?7e TAG1 gene insertion in Arabidopsis mutant AS11 affects seed 
triacylglycerol accumulation, but not sterol ester accumulation in seeds. 

25 Because TAG1 also showed some sequence homology to acyl CoA: 

cholesterol acyttransferases (ACATs) from a number of species (Chang et a/., 
1997), the inventors compared both triacylglycerol and sterol ester accumulation in 
seeds of the wild-type A. thaliana and AS1 1 mutant. While the triacylglycerol 
content and TAG/DAG ratios were reduced in AS1 1 (i.e. increased proportion of 

30 seed oil DAGs.) in contrast, the proportions of sterol esters in WT and AS1 1 seeds 
were similar, at 0.8 and 1% of the total lipid extract, respectively (Table 1). If the 
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TAG1 lesion affected ACAT-like activity, one might expect a reduction in seed sterol 
esters, but this was not observed. These results indicated that TAG1 is not involved 
in steroi-ester homeogenesis, and thus not an acyl CoA: sterol acyltransferase. 

TAG1 expression in yeast 

5 The TAG1 cDNA overexpressed in yeast resulted in a 3.5 to 4-fold increase 

in microsomal DGAT activity compared to plasmid only (pYES2) control 
transformants (Fig. 8), confirming that the TAG1 gene product functions as a DGAT. 
When 14 C18:1-CoA was added to the yeast lysates, sterol esters were also labeled 
in vitro (data not shown), but there was no significant difference in the 14 C-labeted 

10 sterol esters produced by lysates from the pYES2 GAL-induced control and the 
pYES2:TAG1 Gal-induced transformant. This confirms that the TAG1 product does 
not encode an acyl-CoA: sterol acyltransferase (like ACAT). 

Complementation of the A. thaliana AS1 1 Mutant Line by Transformation 
with the DGATcDNA. 

15 The cloned full-length DGAT cDNA was used as a template for PCR 

amplification with the primers DGATXbal (CTAGTCTAGAATGGCGATTTTGGA; SEQ 
IN NO: 12) and DGATXhol (GCGCTCGAGTTTCATGACATCGA; SEQ ID NO:13) to 
provide new restriction sites on each end of the sequence as described in 
Experimental Procedures. A 1.6kb fragment was excised by a Xbal/Kpnl digestion 

20 and ligated into the corresponding sites of the pSE129 vector (provided by Dr, P. 
Covello, PBI/NRC). pSE129A is a vector derived from the plant transformation 
vector pRD400 (Datta et al. 1992). The vector pSE129A contains the seed-specific 
napin promoter and the nos terminator cloned into the EcoRI and Hindlll sites of the 
pRD400 plasmid (Figure 9). Hence in the DGAT-pSE129A construct, the 

25 Arabidopsis DGAT cDNA is under the control of the napin promoter. The construct 
integrity was confirmed by sequencing. 

The pSE129A containing the napinrDGATcDNA was introduced into A. 
tumefaciens, used to transform A, thaliana mutant AS11, and progeny analyzed as 
described in Experimental Procedures. A number of T 2 transgenic lines were 
30 isolated which complemented the fatty acid mutant phenotype found in AS1 1 

(reduced 20:1 and elevated polyunsaturated C 18 s), restoring the wild-type seed fatty 
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add profile (Figure 10). This finding confirms the nature of the lesion in AS1 1 and 
directly ties the AS1 1 lipid phenotype to this mutation. 

Over-Expression of the DGATcDNA in Wild-Type A. thaliana 

The cloned full-length DGAT cDNA was used as a template for PCR 
5 amplification with the primers DGATXbal (CTAGTCTAGAATG6CGATTTTGGA ; SEQ 
ID NO:12) and DGATXhol (GCGCTCGAGTTTCATGACATCGA; SEQ ID NO:13)to 
provide new restriction sites on each end of the sequence as described in 
Experimental Procedures. A 1.6kb fragment was excised by a Xbal/Kpnl digestion 
and ligated into the corresponding sites of the pSE129 vector (provided by Dr. P. 
10 Coveilo, PBI/NRC). pSE129A is a vector derived from the plant transformation 
vector pRD400 (Datla et al. 1992). The vector pSE129A contains the seed-specific 
napin promoter and the nos terminator cloned into the EcoRI and Hindlll sites of the 
pRD400 plasmid (Figure 9). Hence in the DGAT-pSE129A construct, the 
Arabidopsis DGAT cDNA is under the control of the napin promoter. The construct 
1 5 integrity was confirmed by sequencing. 

The pSE129A containing the napin: DGATcDNA was introduced into A. 
tumefaciens, used to transform wild-type A thaliana, and progeny analyzed as 
described in Experimental Procedures. A number of T 2 transgenic lines were 
isolated which exhibited an increased oil content (Figure 1 1) an increased average 
20 100-seed weight (Figure 12) and a strong linear correlation between the two traits 
(Figure 13). 

In terms of fatty acyl composition, wild type lines contaning over-expressed 
DGAT cDNA showed a decrease in the total saturates, and increases in the 
monounsaturates and in the 18:1/[18:2 + 18:3] index, as shown in Table 2 below. 
25 Such changes are all towards a 'healthier* oil profile, and can be applied directly to 
canola, other oilseeds in the Brassicaceae and other edible oil crops to produce 
similar oil composition improvements. 
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Table 2 

Fatty acid composition of seed oil from A thaliana non-transformed wild-type 
controls (WT Con) and three T2 transgenic lines (2\ 9 and 11) of wild type 
transformed with the DGAT cDNA under the control of a napin promoter 

(napin: DGAT). 


Line 

Total Saturates 3 
Wt% 

Monounsaturates" 
Wt% 

18:1/118:2 + 18:3] 
index' 

WT Control 

15.1+0.1 

36.7 + 0.2 

29.9 + 0.6 

2' napin:DGAT 

13.4 

38.6 

34.1 

9 napin: DGAT 

13.1 

39.3 

35.6 

11 napin: DGAT 

13.1 

38.3 

33.0 


a Includes 16:0, 18:0, 20:0, 24:0 
includes 18:1. 20:1, 22:1, 24:1 
c ((Wt % 18:1] + [Wt % 18:2 + Wt % 18:3]) x 100 


EXPERIMENTAL PROCEDURES 
Plant Material 

5 Arabidopsis thaliana ecotype Columbia and mutant AS1 1 were grown under 

conditions described previously (Katavic et aL, 1995). The A. thaliana mutant line 
AS11 was generated and characterized relative to wild type (WT) A. thaliana 
ecotype Columbia, as described by Katavic et al., (1995); (ATCC NO: PTA-1013). 

DNA manipulation 

10 Standard methods and procedures were used for DNA preparation, plasmid 

propagation and isolation (Sambrook et a/., 1989). Sequencing was conducted on 
an Applied Biosystems Model 373A DNA Sequencing System using the Taq 
DyeDeoxy™ Terminator Cycle Sequencing Kit (Applied Biosystems, Inc.). The 
nucleotide and the deduced amino acid sequences were compared with sequences 

15 available in databanks using the BLAST program (Altschul ef aL, 1990). 


SUBSTITUTE SHEET (RULE 26) 


WO 00/361 14 


PCT/CA99/01202 


26 

Southern and Northern analysis 

Total RNA was extracted from different tissues at various developmental 
stages, using the method of Lindstrom and Vodkin (1991). RNA samples were 
denatured with formaldehyde and separated on 1.2% fbrmaldehyde-agarose gels. 
5 About 5 |ig of total RNA was loaded, and the amount of RNA per lane was 

calibrated by the ethidium bromide-staining intensity of the rRNA bands. Genomic 
DNA was isolated, digested with restriction enzymes and a Southern blot analysis 
was performed according to Sambrook et al. (1989). The TAG1 DNA probe was "P 
labeled by random-priming according to protocols of the manufacture (BRL). 

10 PCR strategy 

Primers used for the amplification of the TAG1 gene were as follows: 
DGAT1 (agacacgaatcccattcccaccga; SEQ ID NO:14), DGAT2 
(agtggtgacaacgcagggatgatg; SEQ ID NO: 15), DGAT3 (atggtcgctcccacattgtgt; 
SEQ ID NO: 16), DGAT4 (catacaatccccatgacatttatca; SEQ ID NO:17). DGAT1 

15 and DGAT2 amplify the 5 1 half of the TAG1 gene and DGAT3 and DGAT4 amplify 
the 3* end of the TAG1 gene. Genomic DNA from AS1 1 was used as template for 
PCR amplification of the mutant TAG1 allele using the thermal profile: 94° C 3 min; 
40 cycles of 94° C 30 seconds, 62° C 45 seconds, 72° C 1 min; and 72° C 15 min. 
To further confirm the mutation, primers A (cgaccgtcggttccagctcatcgg: [SEQ ID 

20 NO:18]) and B (gcggccaatctcgcagcgatcttg; [SEQ ID NO:19]), as well as primers C 
(taaacagtagactcatcatcg; [SEQ ID NO:20]) and B, were used in pairs, respectively, 
to amplify the internal fragment containing the mutation. The primers DGAT1 and 
DGAT4 were used for PCR amplification of the cDNA with an A. thaiiana sHique 
cDNA library as template. Primers A and B were also used in RT- PCR 

25 amplification of the cDNA fragment encompassing the insertion segment 

Construction ofTAGI Multicopy Vector and Transformation and 
Characterization of DGAT Expression in Yeast 

The TAG1 cDNA was cloned into pBluescript SK as described (Hadjeb and 
Berkowitz, 1996). The cDNA was cut out from the vector with Kpnl/Xbal, and 
30 subsquently cloned into the respective sites of the yeast expression vector pYes2 
(Invitrogen). The construct was confirmed by sequencing. Constructs with TAG1 
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transcription under the control of the GAL1 promoter released a fragment of 
approximately 1.9 kb. Because the TAG1 fragment has its own initiating ATG 
codon, the product expressed is not a fusion protein. As a host for yeast expression, 
an SLC deletion strain (YMN5 [sld A2::LEU2 ura3 ]) (kindly provided by M.M. 

5 Nagiec and R.C. Dickson, University of Kentucky, Lexington, KY; Nagiec et al., 
1993) was used; we reasoned that in this mutant, the endogenous DAG pool may 
be lower than in WT yeast, and that this would allow us to maximize the activity 
from over-expressed TAG1 in the presence of exogenously supplied U C-DAG 
during in vitro DGAT assays of transformant ly sates. Yeast transformation was 

10 performed according to Elble (1992). YMN5 transformants containing vector only 
(pYES2) were used as controls. Single colonies were cultured overnight in 20 mL of 
SD medium (Synthetic Dextrose medium with glucose and without uracil, as 
described by Ausubel et al., 1995, Vol. 2, p. 13.1.3) on a rotary shaker (270 rpm) at 
28°C. Cells were pelleted from the overnight culture and resuspended in 50 mL of 

15 medium for induction of expression (SD medium containing galactose and without 
uracil). Cells were reincubated at 28°C, with shaking at 270 rpm, and harvested 
after 4-6 hr. GAL-induced yeast transformants were harvested by centrifugation at 
5000 rpm for 5 min and resuspended in 100 mM Hepes-NaOH, pH 7.4, containing 1 
mM EDTA and 1 mM DTT. Cell lysates were prepared using acid-washed glass 

20 beads as described by Ausubel et al. (1995). Protein in yeast lysates was measured 
using the Bradford (1976) assay, protein levels in each lysate were normalized and 
aliquots (250 \xg protein) were assayed for DGAT activity as described below. 

Lipid Substrates and DGA T Analyses 

14 C-labeled diolein [1- 14 C oleic] (Sp. activity 55 mCi/ mmol) was purchased 
25 from American Radiolabeled Chemicals (St. Louis, MO). The 14 C~labelled sn-1 t 2- 
diolein isomer was purified by TLC on borate-impregnated plates and emulsified in 
Hepes buffer the presence of 0.2% Tween-20 as described by Taylor et al., (1991). 
20:1-CoA, CoASH, ATP, and ail other biochemicals were purchased from Sigma. 

DGAT assays were conducted at pH 7.4, with shaking at 100 rpm in a water 
30 bath at 30°C for 30-60 min. Assay mixtures (0.5 mL final volume) contained lysate 
protein (250 ng), 90 mM Hepes-NaOH, 0.5 mM ATP, 0.5 mM CoASH, 1 mM MgCI 2 , 
200 nM sn-1,2 diolein (sp. activity 2 nC/nmol) in 0.02% Tween 20, and 18 \xM 20:1- 
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CoA as the acyl donor. The 14 Olabeled TAGs were isolated by TLC and quantified 
as described by Taylor et al (1991). 

Further Lipid and Sterol Ester Analyses inAS11 and WT: 

Total lipid extracts (TLEs), and lipid class analyses in WT and the AS11 
5 mutant were performed as described by Taylor et al. , (1991; 1992) and by Katavic 
et a/., (1995). Relative seed oil content was also measured by magic angle sample 
spinning 1 H-NMR, according to the method of Rutar (1989). Analyses were 
conducted with 200-seed samples of intact wild-type and AS1 1 seeds using a 
Bruker AM wide-bore spectrometer (Bruker Analytische Masstechnik GHBH, 
10 Silberstreifen D-76287, Rheinstetten4/Karlstuhe f Germany) operating at 360 MHz. 
To reduce anisotropic line broadening, the seed sample was rotated at 1 kHz in a 
zirconium rotor oriented 54.7° to the magnetic field. The integration response for 
resonances attributable to liquid-like oil were summed and the value for AS1 1 seed 
was recorded relative to the response for the WT control seed sample, the latter set 
15 at a value of 1.00. 

Sterol esters were purified from the TLEs by thin layer chromatography 
(TLC) on Silica H plates developed in hexane: diethyl ether: formic acid (80:20:2, 
v/v/v). After elution from the silica H with chloroform: methanol (2:1, v/v), the sterol 
esters were quantified by saponification followed by methylatfon of the resulting fatty 

20 acids with 3N methanolk>HCI. The fatty acid methyl esters (FAMEs) were analyzed 
by GC as described previously (Taylor et aL t 1991). The free sterols released by 
saponification were also analyzed by GC on a 30 m DB-5 column; GC temperature 
program : initial temp: 180°C, increasing at 10°C/min to 300°C and held at this 
temperature for 15 min. The sterol ester content was reported as a % of the TLE; 

25 i.e. FAMEs released from sterol esters calculated as proportion of the FAMEs 
released by transmethylation of the total lipid extract (TLE). 

Construction of Plant Transformation Vector Containing the Wild-Type DGAT 
gene for Over-Expression in WTA. thaliana and Complementation of the A. 
thaliana AS11 Mutant: 

30 

Two primers: 


WO 00/36114 


PCT/CA99/01202 


29 

Gen 1 (GAGAGGATCCACGCTCACGACCCATTCTTCCCG; (SEQ ID 
NO:21]), and 

Gen 2 (AAGAAGGATCCATCCCCAAAACGGGACCACCAA; [SEQ ID 
NO:22]) 

5 were synthesized according to sequences upstream and downstream of the 

TAG1 gene. These primers were used to PCR amplify a genomic fragment of 5,1 
kb from wild-type A. thaliana. The PCR fragment was purified and digested with 
BamHI and inserted into the corresponding site in plasmid pRD400 (Datla et al. 
1992) to generate the plant transformation vector DGATg-pRD400. 

10 Construction of DGA T cDNA Plant Transformation Vector for Seed-Specific 
Expression: 

The cloned full-length DGAT cDNA was used as a template for PCR 
amplification with the primers DGATXbal {CTAGTCTAGAATGGCGATTTTGGA; SEQ 
ID NO:12) and DGATXhol {GCGCTCGAGTTTCATGACATCGA; SEQ ID NO: 13) to 

15 provide new restriction sites on each end of the sequence. The PCR profile was as 
follows: 94°C 1 min; 30 cycles of 94°C 30 seconds, 55°C 30 seconds, 72°C 1 min; 
and 72°C 5 min. The PCR product was then ligated into the PCR-2.1 vector 
(InVitrogen). A 1.6kb fragment was excised by a Xbal/Kpnl digestion and ligated 
into the corresponding sites of the pSE129 vector (provided by Dr. P. Covello, 

20 PBl/NRC). pSE129A is a vector derived from the plant transformation vector 
pRD400 (Datla et al. 1992). The vector pSE129A contains the seed-specific napin 
promoter and the nos terminator cloned into the EcoRI and Hindlll sites of the 
pRD400 plasmid (See Figure 9). Hence in the DGAT-pSE129A construct, the 
Arabidopsis DGAT cDNA is under the control of the napin promoter. The construct 

25 integrity was confirmed by sequencing. 

Transformation of Agrobactehum with Plant DGAT Vector Constructs: 

Electrocompetent Agrobacterium cells, GV3101 (pMP90) strain, were 
prepared as follows: An Agrobacterium culture was grown 24 to 48 hrs in 2YT, and 
when the absorbance at 600 nm reached 0.5 to 0.7, the ceils were chilled on ice 
30 and pelleted by centrifugation (5,000 x g, 10 min in a GSA rotor at 4°C). The pellet 
was washed in 1 , 0.5 and 0.02 volumes of cold 10% sterile glycerol and 
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resuspended in 0.01 volume of cold 10% glycerol. The electrocompetent cells were 
then frozen in liquid N 2 and stored at -70°C. The Agrobactehum cells were 
transformed by electroporation with 20-50 ng of transforming DNA (either DGATg- 
pRD400 or DGAT-pSE129A) according to the manufacturer's instructions, plated on 
5 a selective medium {LB with 50 ng/mL kanamycin) and incubated overnight at 28°C. 
Single transformed cells were grown overnight (28°C, 225 r.p.m.) in 5 mL LB with 
50 ng/mL Kanamycin and 25 ng/mL Gentamycin. DNA extraction and purification 
were performed with a Qiaprep Spin Miniprep kit (Qiagen). The fidelity of the 
construct was re-checked by DNA sequencing before plant transformation. 

1 0 Transformation of Arabidopsis thallana: 

The transformation protocol was adapted from that described by Clough and 
Bent (1998). Seeds of Arabktopsis thatiana ecotype Columbia and mutant AS11 
(Katavic et al. t 1995) were grown at 22°C under fluorescent illumination (120 ^E-nrr 
2 s* 1 ) in a 16 h light/8 hour dark regime. Typically, four to six plants were raised in a 

15 10 cm 2 pot in moistened Terra-lite Redi-earth (W. R. Grace & Co. Canada Ltd. Ajax, 
ON, Canada). To prevent the soil mix in the pot from falling into the inoculation 
media, soil was mounded as a platform with seeds sown on top, and the whole pot 
covered by a nylon window screen and secured by a rubber band. Plants were 
vacuum infiltrated in an Agrobacterium suspension when the first flowers started 

20 opening. 

To grow Agrobactehum, a 5 mL suspension in LB medium containing 50 
^ig/mL kanamycin and 25 ng/mL gentamycin was cultured overnight at 28°C The 
day before infiltration, this "seed culture" was divided into four flasks containing 250 
mL of LB medium supplemented with 50 ng/mL kanamycin and 25 ng/mL 

25 gentamycin. These culture were grown overnight at 28°C. The next morning after 
the absorbance at 600 nm was checked (approximately = 1 .0), the cells were 
harvested by centrifugation (5,000 x g t 10 min in a GSA rotor at room temperature) 
and resuspended in the infiltration medium (sucrose 5%; Silwet-77 0.005% in water) 
to obtain an optical density at 600 nm of 0.8. The Agrobactehum suspension was 

30 then poured into a beaker and the potted plants inverted into the beaker so that the 
flowers and bolts were submerged. The beaker was then placed into a large Bell jar 
and a vacuum drawn using a vacuum pump, until bubbles formed on the stem 


WO 00/36114 


PCT/CA99/01202 


31 

surfaces and the solution started to bubble slightly, and then the vacuum was 
released rapidly. [Note: The necessary time and pressure will vary from one lab 
setup to the next, but good infiltration is visibly apparent as uniformly darkened, 
water-soaked tissue.] Pots were removed from the beaker, laid on their side in a 
5 plastic tray and covered with a plastic dome, to maintain humidity. The following 
day, the plants were uncovered, set upright and allowed to grow for approximately 
four weeks in a growth chamber under continuous light conditions as described by 
Katavic et al., (1995). When the siliques were mature and dry, seeds were 
harvested and selected for positive transformants. 

1 0 Selection of Putative Transformants (Transgenic plants) and Analysis of 
Transgenic Plants: 

For each construct, seeds were harvested in bulk. Seeds were surface- 
sterilized by submerging them in a solution containing 20% bleach and 0.01% Triton 
X-100 for 20 min, followed by three rinses with sterile water. Sterilized seeds were 

15 then plated by resuspending them in sterile 0.1% phytagar at room temperature 
(about 1 mL phytagar for every 500-1000 seeds), and then applying a volume 
containing 2,000-4,000 seeds onto 150 x 15 mm kanamycin selection plate. Plates 
were incubated for 2 days in the cold without light, and then grown for 7-10 days in 
a controlled environment (22°C under fluorescent illumination (120 nE-m'^s" 1 ) in a 

20 16 h light/8 hour dark regime). The selection media contain !4 MSG medium, 0.8% 
phytagar, 3% sucrose, 50ug/mL kanamycin and 50ug/ mL Timentin. Petri dishes 
and lids were sealed with a Micropore™ surgical tape tape (3M Canada, London, 
ON, Canada). After 7-10 days, drug-resistant plants that had green leaves and well 
established roots within the medium were identified as transformants and at the 3-5 

25 leaf stage, selected transformants were transplanted into flats filled with heavily 
moistened soil mix. Transformants were grown to maturity and mature seeds (T 2 
generation as defined in Katavic et al., (1994)) were harvested from 
indivldualplants, and further analyzed. 

DAW isolation from and analysis of Transformants 

30 Genomic DNA was isolated from individual T t plants following the protocol of 

Dellaporta et a/., (1983). A PCR amplification using the paired primers described 
previously for the DGAT cDNA or for the DGAT gene, was performed to confirm the 
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presence of the cDNA or the gene, respectively, in the T, transforrnants. Southern 
analyses (Southern, 1975) were performed to select the transforrnants containing a 
single copy of the inserted fragment. DNA samples were digested with restriction 
enzymes (Bgl II for the DGAT cDNA and Eco Rl for the DGAT gene), resolved by 

5 electrophoresis on a 1% agarose gel, and Southern blotting performed using a 
nylon filter (Hybond-N+, Amersham) according to Sambrook et al. (1989). The 
DGAT cDNA fragment, labelled with a-^P] dCTP (NEN/DuPont) using the Random 
Primer DNA labelling kit (Gibco BRL), was used as a probe. Hybridization was 
performed at 60°C according to Church and Gilbert (1984). The filter was then 

10 exposed to Kodak X-OMAT-AR film. 

DEPOSIT INFORMATION 

The following biological material has been deposited at the American Type 
Culture Collection (ATCC) of 10801 University Boulevard, Manassas, Virginia, 
201 10-2209, U.S.A. All of these deposits were made on behalf of the 
1 5 Applicant/Assignee (National Research Council of Canada) under the terms of the 
Budapest Treaty on the dates indicated, and have been given the accession 
numbers shown below. 

Deposited Material Date of Deposit Accession No. 

Arabidopsis DGAT gene November 29, 1999 PTA-988 
20 Arabidopsis DGAT cDNA November 29, 1 999 PTA-989 

Arabidopsis ASH seeds December 3, 1999 PTA-1013 

The deposit receipts are shown later in this description. 

25 SEQUENCE LISTING FREE TEXT 

The Sequence Listing provided below contains free text entries in respect of 
SEQ ID NOs: 12 to 22. The free text used in the Sequence Listing is repeated as 
follows: 

SEQ ID NO:12 Primer of DGATXbal 
30 SEQ ID NO:13 Primer of DGATXhol 
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SEQ ID NO:14 


Primer DGAT1 


SEQ ID NO:15 


Primer DGAT2 


SEQ ID NO:16 


Primer DGAT3 


SEQ ID NO:17 


Primer DGAT 4 


5 


SEQ ID NO:18 


Primer A 


SEQ ID NO:19 


Primer B 


SEQ ID NO:20 


Primer C 


SEQ ID NO:21 


Primer Gen 1 


SEQ ID NO:22 


Primer Gen 2. 


10 

A summary of all of the listed sequences is provided below for ease of 
review: 

SEQ ID NO:1 - (pDGAT; vector containing isolated and purified deoxyribonucleic 

acid cDNA; ATCC No PTA-989), Genbank/EMBL Accession No. AJ238008. 

15 SEQ ID NO:2 - The deduced amino acid sequence of the Arabidopsis DGAT 
(AtTAGI) protein. 

SEQ ID NO:3 - (pgenomic DGAT; vector containing isolated and purified genomic 
deoxyribonucleic acid (genomic DNA) ATCC No PTA-988). 

SEQ ID NO:4 - MDGAT, mouse DGAT [GenBank/EMBL Accession No. AF078752 
20 (Cases efa/., 1998)]. 

SEQ ID NO:5 - HARGP1, human ARGP1 protein [GenBank/EMBL Accession No. 
AF059202; Oelkers etal., 1998]. 

SEQ NO:6 - Arabidopsis thaJiana expressed sequence tag (EST) [accession no. 
AA042298). 

25 SEQ ID NO:7 - A diacylglycerol/phorbol ester-binding motif found in SEQ ID NO:2, 
SEQ ID NO:8 and SEQ ID NO:9 ( 4,4 HKWMVRHIYFP* 24 ). 

SEQ ID NO:8 - B. napus DGAT amino acid sequence GenBank EMBL Accession 
No AF1 55224. 
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SEQ ID NO:9 - B. napus DGAT amino add sequence GenBank/EMBL Accession 
No. AF1 64434. 

SEQ ID NO: 10 - Targeting motif typical of members of the SnRK1 protein kinase 
family found in SEQ ID NO:2, SEQ ID NO:8 and SEQ ID NO:9 X-L^-X-K 2 *- 
5 X-X-S^-X-X-X-V 209 

SEQ ID NO:1 1 - A 27 amino acid insertion repeat in SEQ ID NO:2 found in the 
Arabidopsis thaliana AS1 1 mutant 

131 SHAGLFNLCVWLIAVNSRLIIENLMK 157 

SEQ IN NO: 12 - CTAGTCTAGAATGGCGATTTTGGA ( nucleotide sequence of 
10 Primer DGATXbal ). 

SEQ ID NO: 13 - GCGCTCGAGTTTCATGACATCGA ( nucleotide sequence of 
Primer DGATXhol). 

SEQ ID NO:14 - AGACACGAATCCCATTCCCACCGA ( nucleotide sequence of 
Primer DGAT1). 

15 SEQ ID NO:15 - AGTGGTGACAACGCAGGGATGATG ( nucleotide sequence of 
Primer DGAT2). 

SEQ ID NO:16 - ATGGTCGCTCCCACATTGTGT ( nucleotide sequence of Primer 
DGAT3). 

SEQ ID NO:17 - CATACAATCCCCATGACATTTATCA ( nucleotide sequence of 
20 Primer DGAT4). 

SEQ ID NO:18 - CGACXJGTCGGTTCCAGCTCATCGG ( nucleotide sequence of 
Primer A). 

SEQ ID NO:19 - GCGGCGAATCTCGCAGCGATCTTG (nucleotide sequence of 
Primer B). 

25 SEQ ID NO:20 - TAAACAGTAGACTCATCATCG (nucleotide sequence of 
Primer C). 
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SEQ ID NO:21 - GAGAGGATCCACGCTCACGACCCATTCTTCCCG (nucleotide 
sequence of primer Gen 1 ). 

SEQ ID NO;22 - AAGAAGGATCCATCCCCAAAACGGGACCACCAA 

(nucleotide sequence of primer Gen 2). 

5 SEQ ID NO:23 - AS1 1 mutant DGAT cDNA nucleotide sequence. 

SEQ ID NO:24 - AS1 1 mutant DGAT genomic DNA nucleotide sequence. 

SEQ ID NO:25 - the deduced amino acid sequence of SEQ ID NO:23. 
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ATCC 
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BUDAPEST TREATY ON THE INTERNATIONAL RECOGNITION OF 
THE DEPOSIT OF MICROORGANISMS FOR THE PURPOSES OP PATENT PROCEDURE 


INTERNATIONAL FORM 

RECEIPT IN THE CASE OF AN ORIGINAL DEPOSIT ISSUED PURSUANT TO RULE 7 J 
AND VIABILITY STATEMENT ISSUED PURSUANT TO RULE 102 


lot (tome and Address of Depositor or Attorney) 

NEC Canada 
Atnr David C Taylor 
Plant B iotechnolo gy Instftutc 
HOOynnasiumPbce 
Saskatoon, SK S7N 0W9 


RfcCEIVLb 

KEGB 

DEC 1 0 1999 


EKTEREd. 


CHECKED. 


Deposited on Behalf of: National Research Comal of Canada 

Identification Reference by Depositor: Patent Deposit Designation 

Ambhbps'u DGAT&M PTA-9S8 
ArabiJopilsDQAJdSHA PTA-989 
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deposits wcie received November 29. 1 999 bv thia fotematinnil Pepoatmy Ariharity and have been accented. 
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assise, whichever is longer. The United States and nw^crtha countries nieri 
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ATCC 


lOSOIUmvent^BM • MiWDt, Va 201 10-220) • TWefhcme: 70V365-J700 • PAX TO- 

BUDAPEST TREATY ON THE INTERNATIONAL RECOGNITION OF 
THE DEPOSIT OF MICROORGANISMS FOUTOB PURPOSES OF PATENT PROCEDURE 

INTERNATIONAL FORM 

RECEIPT IN THE CASE OF AN ORIGINAL DEPOSIT ISSUED PURSUANT TO RULE 7.3 
AND VIABILITY STATEMENT ISSUED PURSUANT TO RULE 10.2 

To: (Name mnd Address of Depositor or Attorney) 

Nations! Research Council of Canada 

Ann: David C.Taylor 

P lint Biotechnology Institute 

110 Gymnasium Piece 
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CLAIMS: 

1. Isolated and purified deoxyribonucleic acid (DNA), characterized in that 
said DNA includes a sequence according to SEQ ID NO:1 or a part of SEQ ID NO:1, or 
a sequence that is substantially homologous to SEQ ID NO:1. 

2. Isolated and purified deoxyribonucleic acid (DNA), characterized in that 
said DNA includes a sequence according to SEQ ID NO:3 or a part of SEQ ID NO:3 f or 
a sequence that is substantially homologous to SEQ ID NO:3. 

3. A vector for transformation of plant cells, characterized in that said 
vector contains a deoxyribonucleic acid sequence according to SEQ ID NO:1, or a part 
of SEQ ID NO:1 , or a sequence that is substantially homologous to SEQ ID NO:1 . 

4. A vector for transformation of plant cells, characterized in that said 
vector contains a deoxyribonucleic acid sequence according to SEQ ID NO:3, or a part 
of SEQ ID NO:3, or a sequence that is substantially homologous to SEQ ID NO:3. 

5. A vector for transformation of plant cells, characterized in that said 
vector contains a deoxyribonucleic acid sequence according to SEQ ID NO:23, which 
is SEQ ID NO: 1 altered to contain an 81 bp insertion, such that the deduced amino 
acid sequence of the encoded protein contains the repeated sequence 
SHAGLFNLCNAA/LIAVNSRLIIENLMK according to SED ID NO:25, where the spacing 
and identity of the underlined amino acids are identical or are replaced by conserved 
substitutions. 

6. A vector according to claim 3 or claim 5, characterized in that said 
sequence is present in said vector in a sense orientation, 

7. A vector according to claim 3, characterized in that said sequence is 
present in said vector in an anti-sense orientation. 

8. Plasmid pDGATcDNA (ATCC PTA-989). 

9. Plasmid pDGATgene (ATCC PTA-988). 
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10. A plant having a genome, characterized in that the genome contains an 
introduced nucleotide sequence of SEQ ID NO:1 , or a part of SEQ ID NO:1, or a 
sequence that is substantially homologous to SEQ ID NO:1. 

11. A plant seed having a genome, characterized in that said genome 
contains an introduced nucleotide sequence of SEQ ID NO:1 , or a part of SEQ ID 
NO:1. or a sequence that is substantially homologous to SEQ ID NO:1. 

12. A genetically transformed plant, characterized in that said genome has 
been transformed by a vector according to claim 3 or claim 4 or claim 5. 

13. A genetically transformed plant seed, characterized in that said seed 
has been transformed by a vector according to claim 3 or claim 4 or claim 5. 

14. A plant seed as claimed in Claim 1 1 or Claiim 13 t characterized by 
exhibiting an altered seed oil content compared to an average of a statistically- 
significant number of seeds of genomically-unmodified plants of the same genotype 
grown in identical conditions at the same time. 

15. A plant seed as claimed in Claim 1 1 or Claim13, characterized by 
exhibiting an altered diacylglycerol content in its seed oil compared to an average of a 
statistically-significant number of seeds of genomically-unmodified plants of the same 
genotype grown in identical conditions at the same time. 

16. A plant seed as claimed in 1 1 or 13, characterized by exhibiting a seed 
oil with an altered fatty acyl composition compared to an average of a statistically- 
significant number of seeds a genomically-unmodified plantof the same genotype 
grown in identical conditions at the same time. 

17. A plant as claimed in 10 or 12, characterized by exhibiting an enhanced 
biomass compared to an average of a statistically-significant number of genomically- 
unmodified plants of the same genotype grown in identical conditions at the same time. 

18. A seed as claimed in 1 1 or 1 3, characterized by exhibiting an enhanced 
biomass compared to an average of a statistically-significant number of seeds of 
genomically-unmodified plants of the same genotype grown under identical conditions 
at the same time. 
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19. A method of producing transgenic plants by introducing a nucleotide 
sequence into a genome of said plant, characterized in that said nucleotide sequence 
introduced into said genome includes SEQ ID NO:1 or SEQ ID NO:3, or a part of SEQ 
ID NO:1 or SEQ ID NO:3, or a sequence that is substantially homologous to SEQ ID 
NO:1 , or to SEQ ID NO:3, or to or SEQ ID NO: 1 . or a part of SEQ ID NO:1 or SEQ ID 
NO:3, or a sequence that is substantially homologous to SEQ ID NO:1 or SEQ ID 
NO:3 or SEQ ID NO;1 containing an 81 bp insertion [SEQ ID NO:23] such that the 
deduced amino acid sequence of the encoded protein contains the repeated sequence 
SHAG LFN LCVWLIAVN SRLI IENLMK according to SED ID NO.25, where the spacing 
and identity of the underlined amino acids are identical or are replaced by conserved 
substitutions. 

20. A method according to daim 19, characterized in that said plant is a 
member of the Brassicaceae. 

21: A method according to claim 19, characterized in that said plant is 
selected from the group consisting of Arabidopsis thaliana, borage (Borago spp.), 
Canola, castor {Ricinus communis), cocoa bean {Theobroma cacao), corn (Zea mays), 
cotton (Gossypium spp), Crambe spp., Cuphea spp., flax (Linum spp.), Lesquerella 
and Limnanthes spp., Linoia, nasturtium (Tropaeolum spp.), Oenothera spp., olive 
(O/ea spp.), palm (Baeis spp.), peanut (Arachis spp.), rapeseed, safflower (Carthamus 
spp.), soybean (Glycine and Soya spp.) ( sunflower (Helianthus spp.), tobacco 
[Nicotians spp.), Vernonia spp., wheat (Triticum spp.), barley (Hordeum spp.), rice 
(Oryza spp.), oat (Avena spp.) sorghum (Sorghum spp ), rye (Secale spp.) and other 
members of the Gramineae. 

22. A plant DNA sequence or part thereof, characterized in that the 
sequence is substantially homologous to at least a part of SEQ ID NO:1 or SEQ ID 
NO: 3, and in that said sequence has been isolated, characterized or designed using 
sequence information from SEQ ID NO:1 or SEQ ID NO:3, or SEQ ID NO:1 containing 
an 81 bp insertion [SEQ ID NO:23] such that the deduced amino acid sequence of the 
encoded protein contains the repeated sequence 

SHAGLFNLCVWLIAVNSRLIIENLMK according to SED ID NO:25, where the spacing 
and identity of the underlined amino acids are identical or are replaced by conserved 
substitutions. 
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23. A method of changing the oil content, acyl composition or diacyiglycerol/ 
triacylglycerol proportions of the seed oil of plant seeds by introducing a sense or anti- 
sense nucleic acid construct into a plant transformation vector, using the vector to 
transform the genome of a plant or plant seed, and then growing the plant or plant 
seed and extracting the oil from the plant seed, characterized in that said nucleic acid 
sequence is SEQ ID NO:1 or SEQ ID NO:3, or a part of SEQ ID NO:1 or SEQ ID NO:3, 
or a sequence that is substantially homologous to SEQ ID NO:1 or SEQ ID NO:3 or 
SEQ ID NO:1 [SEQ ID NO:23] containing an 81 bp insertion such that the deduced 
amino acid sequence of the encoded protein contains the repeated sequence 
SHAGLFNLCVWU AVNSRLl I ENL MK according to SED ID NO:25, where the spacing 
and identity of the underlined amino acids are identical or are replaced by conserved 
substitutions. 
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SEQUENCE LISTING 

<110> National Research Council of Canada 
Zou, Jitao 
Taylor, David C 
Wei , Yangdou 
Jako, Colette C 

<120> Diacylglycerol Acyl Transferase Gene from Plants 

<130> 43922pt 

<140> PCT/CA 
<141> 1999-12-16 

<150> 60/112,812 
<151> 1998-12-17 

<160> 25 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 1904 
<212> DMA 

<213> Arabidopsis thaliana 
<400> 1 

atttcttagc ttcttccttc aatccgctct ttccctctcc attagattct gtttcctctt 60 
tcaatttctt ctgcatgctt ctcgattctc tctgacgcct cttttctccc gacgctgttt 120 
cgtcaaacgc ttttcgaaat ggcgattttg gattctgctg gcgttactac ggtgacggag 180 
aacggtggcg gagagttcgt cgatcttgat aggcttcgtc gacggaaatc gagatcggat 240 
tcttctaacg gacttcttct ctctggttcc gataataatt ctccttcgga tgatgttgga 300 
gctcccgccg acgttaggga tcggattgat tccgttgtta acgatgacgc tcagggaaca 360 
gccaatttgg ccggagataa taacggtggt ggcgataata acggtggtgg aagaggcggc 420 
ggagaaggaa gaggaaacgc cgatgctacg tttacgtatc gaccgtcggt tccagctcat 480 
cggagggcga gagagagtcc acttagctcc gacgcaatct tcaaacagag ccatgccgga 540 
ttattcaacc tctgtgtagt agttcttatt gctgtaaaca gtagactcat catcgaaaat 600 
cttatgaagt atggttggtt gatcagaacg gatttctggt ttagttcaag atcgctgcga 660 
gattggccgc ttttcatgtg ttgtatatcc ctttcgatct ttcctttggc tgcctttacg 720 
gttgagaaat tggtacttca gaaatacata tcagaacctg ttgtcatctt tcttcatatt 780 
attatcacca tgacagaggt tttgtatcca gtttacgtca ccctaaggtg tgattctgct 840 
tttttatcag gtgtcacttt gatgctcctc acttgcattg tgtggctaaa gttggtttct 900 
tatgctcata ctagctatga cataagatcc ctagccaatg cagctgataa ggccaatcct 960 
gaagtctcct actacgttag cttgaagagc ttggcatatt tcatggtcgc tcccacattg 1020 
tgttatcagc caagttatcc acgttctgca tgtatacgga agggttgggt ggctcgtcaa 1080 
tttgcaaaac tggtcatatt caccggattc atgggattta taatagaaca atatataaat 1140 
cctattgtca ggaactcaaa gcatcctttg aaaggcgatc ttctatatgc tattgaaaga 1200 
gtgttgaagc tttcagttcc aaatttatat gtgtggctct gcatgttcta ctgcttcttc 1260 
cacctttggt taaacatatt ggcagagctt ctctgcttcg gggatcgtga attctacaaa 1320 
gattggtgga atgcaaaaag tgtgggagat tactggagaa tgtggaatat gcctgttcat 1380 
aaatggatgg ttcgacatat atacttcccg tgcttgcgca gcaagatacc aaagacactc 1440 
gccattatca ttgctttcct agtctctgca gtctttcatg agctatgcat cgcagttcct 1500 
tgtcgtctct tcaagctatg ggcttttctt gggattatgt ttcaggtgcc tttggtcttc 1560 
atcacaaact atctacagga aaggtttggc tcaacggtgg ggaacatgat cttctggttc 1620 
atcttctgca ttttcggaca accgatgtgt gtgcttcttt attaccacga cctgatgaac 1680 
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gcctcgttgg atctccgttg atgttgtggt 
ataaccattg aagaagaaaa gaaaattaga 
acacgcaaac ccgtttggat tttgttatgg 


2/20 

tcaaaaaatg actttcttca aacatctatg 1740 

ggttctgatg ctaaaacgac aaatagtgtt 1800 

gttgttgtat ctgcaaaaat tttggtagag 1860 

tgtaaagcgg ccgc 1904 


<210> 2 
<211> 520 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 2 

Met Ala lie Leu Asp Ser Ala Gly Val Thr Thr Val Thr Glu Asn Gly 
15 10 15 

Gly Gly Glu Phe Val Asp Leu Asp Arg Leu Arg Arg Arg Lys Ser Arg 
20 25 30 

Ser Ast> Ser Ser Asn Gly Leu Leu Leu Ser Gly Ser Asp Asn Asn Ser 
35 40 45 

Pro Ser Asp Asp Val Gly Ala Pro Ala Asp Val Arg Asp Arg lie Asp 
50 55 60 

Ser Val Val Asn Asp Asp Ala Gin Gly Thr Ala Asn Leu Ala Gly Asp 
65 70 75 80 

Asn Asn Gly Gly Gly Asp Asn Asn Gly Gly Gly Arg Gly Gly Gly Glu 
85 90 95 

Gly Arg Gly Asn Ala Asp Ala Thr Phe Thr Tyr Arg Pro Ser Val Pro 
100 105 110 

Ala His Arg Arg Ala Arg Glu Ser Pro Leu Ser Ser Asp Ala lie Phe 
115 120 125 

Lys Gin Ser His Ala Gly Leu Phe Asn Leu Cys Val Vai Val Leu lie 
130 135 140 

Ala Val Asn Ser Arg Leu He He Glu Asn Leu Met Lys Tyr Gly Trp 
145 150 155 160 

Leu He Arg Thr Asp Phe Trp Phe Ser Ser Arg Ser Leu Arg Asp Trp 
165 170 175 

Pro Leu Phe Met Cys Cys He Ser Leu Ser He Phe Pro Leu Ala Ala 
180 185 190 

Phe Thr Val Glu Lys Leu Val Leu Gin Lys Tyr He Ser Glu Pro Val 
195 200 205 

Val He Phe Leu His He He He Thr Met Thr Glu Val Leu Tyr Pro 
210 215 220 

Val Tyr Val Thr Leu Arg Cys Asp Ser Ala Phe Leu Ser Gly Val Thr 
225 230 235 240 
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Leu Met Leu Leu Thr Cys lie Val Trp Leu Lys Leu Val Ser Tyr Ala 
245 250 255 

His Thr Ser Tyr Asp lie Arg Ser Leu Ala Asn Ala Ala Asp Lys Ala 
260 265 270 

Asn Pro Glu Val Ser Tyr Tyr Val Ser Leu Lys Ser Leu Ala Tyr Phe 
275 280 285 

Met Val Ala Pro Thr Leu Cys Tyr Gin Pro Ser Tyr Pro Arg Ser Ala 
290 295 300 

Cys lie Arg Lys Gly Trp Val Ala Arg Gin Phe Ala Lys Leu Val lie 
305 310 315 320 

Phe Thr Gly Phe Met Gly Phe lie lie Glu Gin Tyr He Asn Pro lie 
325 330 335 

Val Arg Asn Ser Lys His Pro Leu Lys Gly Asp Leu Leu Tyr Ala He 
340 345 350 

Glu Arg Val Leu Lys Leu Ser Val Pro Asn Leu Tyr Val Trp Leu Cys 
355 360 365 

Met Phe Tyr Cys Phe Phe His Leu Trp Leu Asn He Leu Ala Glu Leu 
370 375 380 

Leu Cys Phe Gly Asp Arg Glu Phe Tyr Lys Asp Trp Trp Asn Ala Lys 
385 390 395 400 

Ser Val Gly Asp Tyr Trp Arg Met Trp Asn Met Pro Val His Lys Trp 
405 410 415 

Met Val Arg His He Tyr Phe Pro Cys Leu Arg Ser Lys He Pro Lys 
420 425 430 

Thr Leu Ala He He He Ala Phe Leu Val Ser Ala Val Phe His Glu 
435 440 445 

Leu Cys He Ala Val Pro Cys Arg Leu Phe Lys Leu Trp Ala Phe Leu 
450 455 460 

Gly He Met Phe Gin Val Pro Leu Val Phe lie Thr Asn Tyr Leu Gin 
465 470 475 480 

Glu Arg Phe Gly Ser Thr Val Gly Asn Met He Phe Trp Phe He Phe 
485 490 495 

Cys He Phe Gly Gin Pro Met Cys Val Leu Leu Tyr Tyr His Asp Leu 
500 505 510 

Met Asn Arg Lys Gly Ser Met Ser 
515 520 
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<210> 3 
<211> 5193 
<212> DNA 

<213> Arabidopsis thaliana 


<400> 3 

gctcacgacc cattcttccc gttccatttg gttttattta tttcaaagtt taatattcct 60 
tttgtataac attcaaatcrt tcacatgatt gattgtgtga aaaccccaca gattttacta 120 
caataggggg agttgactta aaatagctat tgatgtcgaa aaaatgtatt ttagttataa 180 
attatactaa agaaaatttt tgatttgtct gttgtttaag catatgtatt gttaaactta 240 
aaaaaatatg tattgttaat cttaaaaatg taggagtaca catcaaatac tcgagcataa 300 
tcaaaaccgt attcatagac cgatgtgaga atcaaataga agataatgtg attttttaaa 360 
atatcgtatc tccaaatcaa tcacttagaa gataatgtaa ttctttatgt gctacataaa 420 
taaatatata tatatatata tatatatatc ttgtatatat gtcttgacaa aaaattgcca 480 
gtcaaaaacc atgactgaat caaactataa gtcggattga atcaaactat aagtcggatg 540 
agtattaatt tccattatgt ttctatactt tacaaaccgg aaaatagata ttatagatac 600 
caaaaaagta gatttgtgta tattattaga agatttggaa tttcatcatt atcaggatct 660 
aaagtacttc cctaattaaa tcatgtcggt tgaaaaagct caatgaatgt ttgaaatttg 720 
gaaagtttat taaattcgga tctttttttt ttgtttgtcg tcccaaacat ttttatttta 780 
ttacaaataa ccaacttatc cttactacta aatcacttca tatctttgat accaacaaat 840 
catttcatat tctattttga tgtttaagaa aacactattt accagttaca aaatattata 900 
aggattgttg tttagaaaaa aaagtacaag ttgaattctt tttgtcaaat ataaaattga 960 
ctttttaata tataattgac ttattgaaca tgattacaga attaatcatc tacaaaactt 1020 
tccaagttta taataaatac atttcaaaga ctattagttc ttcttaaaat atttctaaaa 1080 
gtgatcaaag actaccacat ataattcaga aaaagtagaa gttgatttct ttttgtcaaa 1140 
taaataattg acttaaaata gtttggaaag ccattgaact tgattataga attgataatg 1200 
tacataaaaa aattccaagt ttataataaa tacatttttc aaatgctata tcagttcttc 1260 
ttaaaatatt tcactaaaaa aacactcaaa tatagaataa atttattgaa taacatacca 1320 
actgtaaaac agaatttgac aaaaaaaaaa aaaaaatgaa atgaagatga agacaaaaat 1380 
aaatcaccag aggatcttat gcaaaaaaat atatgaatac acaataaacc atattgatat 1440 
ttttaaaata aaataaaaac agaaaaatat cccaacaccg cttttcaatt aaaaatcttc 1500 
cgtcaccatt gttgtcatct tcctctctcg tgaatccttt ttcctttctt cttcttcttc 1560 
tcttcagaga aaactttgcc tctctttcta taaggaacca gacacgaatc ccattcccac 1620 
cgatttctta gcttcttcct tcaatccgct ctttccctct ccattagatt ctgtttcctc 1680 
tttcaatttc ttctgcatgc rtctcgattc tctctgacgc ctcttttcrc ccgacgctgt 1740 
ttcgtcaaac gcttttcgaa acggcgattt tggactctgc tggcgtxact acggtgacgg 1800 
agaacggtgg cggagagttc gtcgatcttg ataggcttcg tcgacggaaa tcgagaccgg 1860 
attcttctaa cggacttctt ctctctggtt ccgacaataa ttctccttcg gatgatgttg 1920 
gagctcccgc cgacgttagg gatcggattg attccgttgt taacgatgac gctcagggaa 1980 
cagccaattt ggccggagat aataacggtg gtggcgataa taacggtggt ggaagaggcg 2040 
gcggagaagg aagaggaaac gccgatgcta cgtttacgta tcgaccgtcg gttccagctc 2100 
atcggagggc gagagagagt ccacttagct ccgacgcaat cttcaaacag gtttaaaatc 2160 
tcagaaatct tcgaatttgg tgtttgcttg ttgttttata tggaattgag tttggtgatt 2220 
gttttgcatt gcagagccat gccggattat tcaacctctg tgtagtagtt cttattgctg 2280 
taaacagtag actcatcatc gaaaatctta tgaaggtttg ctgttacttg tttctccttt 2340 
taggaattga attgcttgaa aatttatcag agacgaataa ctttgttgtt gctatcattc 2400 
atgtagtatg gttggttgat cagaacggat ttctggttta gttcaagatc gctgcgagat 2460 
tggccgcttt tcatgtgttg gtaaaagaag atgtttttta tttccagcaa tgttacattg 2520 
ttatacgtat aatgatgagt ttagtgatca agttcctctt tgattcttct ttcttgttgc 2580 
agtatatccc tttcgatctt tcctttggct gcctttacgg ttgagaaatt ggtacttcag 2640 
aaatacatat cagaacctgt gagtaattac tattctccag ccattactgt aatttttatt 2700 
gaagacaagt ttgtatcatg aagaacttac aagttctgtt ttgaaaatgc tcaaggttgt 2760 
catctttctt catattatta tcaccatgac agaggttttg tatccagttt acgtcaccct 2820 
aaggtgatac tgtttttctg gtctcagttt gtgatactgt ttttaagttt agttgtctga 2880 
cccggtgatc ttgaaaatgg acaggtgtga ttctgctttt ttatcaggtg tcactttgat 2940 
gctcctcact tgcattgtgt ggctaaagtt ggtttcttat gctcatacta gctatgacat 3000 
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aagatcccta gccaatgcag ctgataaggt aaaatacgaa aaagaagcgt atgtattagt 3060 
cacttgcact gtgttactgt tttaaccaaa cactgttatg aactttaggc caatcctgaa 3120 
gtctcctact acgttagctt gaagagcttg gcatatttca tggtcgctcc cacattgtgt 3180 
tatcaggtaa ctgcaaagtg catcaaccat tcttatactt gcaagagttt cttgtctaaa 3240 
cctcggatct ttgcttttcc ccagccaagt tatccacgtt ctgcatgtat acggaagggt 3300 
tgggtggctc gtcaatttgc aaaactggtc atattcaccg gattcatggg atttataata 3360 
gaacaagtac gttttcacat cttgctttat tagttttcct tggtgaaaat catcatccct 3420 
gcgttgtcac cacttgactt catgttcttt tgttacattt tggcagtata taaatcctat 3480 
tgtcaggaac tcaaagcatc ctttgaaagg cgatcttcta tatgctattg aaagagtgtt 3540 
gaagctttca gttccaaatt tatatgtgtg gctctgcatg ttctactgct tcttccacct 3600 
ttggtatgct gtgatcccat ctctttcaaa ataatttgca aattcgaaaa accgaaaaag 3660 
gctaaatctc atacgaattt gatattttta gtttcttaga gtcggtgatg taatttcagt 3720 
tactgaacgc aaatctcttg tccaaaggtt aaacatattg gcagagcttc tctgcttcgg 3780 
ggatcgtgaa ttctacaaag attggtggaa tgcaaaaagt gtgggagatg tgagctattt 3840 
tactcaaaag aaaacttatg atttttaatg ttgtcgttgt ttttgggtca tctaactaac 3900 
caaattcatg tattcactgt cttcctttat cagtactgga gaatgtggaa tatggtatgg 3960 
ttctcttcct aaacatcacc ttcttttgta cacaaaatag aagaagagag ctaattaaga 4.020 
tcttgttttc cttgacagcc tgttcataaa tggatggttc gacatatata cttcccgtgc 4080 
ttgcgcagca agataccaaa ggtgagtgag atatataccg atatgcaatt gtcgagattt 4140 
gtttccgtga tataaattta accctccaca cacttgtttt tcagacactc gccattatca 4200 
ttgctttcct agtctctgca gtctttcatg aggtatacat actttctaca ttgccctgtc 4260 
tctagacgca tgaacacacg ctagtgaaag aaatgctaat attcaaagca ttgtttttac 4320 
ttaacgatct tgtgttacaa atttcctttt gacagctatg catcgcagtt ccttgtcgtc 4380 
tcttcaagct atgggctttt cttgggatta tgtttcaggt taaaaaatta ctaaactgct 4440 
gcagtcgatt tttactaaac tctaatctca tattctgacc aaccaatttg tttgagtagg 4500 
tgcctttggt cttcatcaca aactatctac aggaaaggtt tggctcaacg gtatgctctc 4560 
aaaacccgag aaaatagaac gaataactct ttctttcata gcctagccat ttaaatcgca 4620 
atgctgaaac ttaataataa aggtgatctg ttttggaatg ggatcatatt attaggtggg 4680 
gaacatgatc ttctggttca tcttctgcat tttcggacaa ccgatgtgtg tgcttcttta 4740 
ttaccacgac ctgatgaacc gaaaaggatc gatgtcatga aacaactgtt caaaaaatga 4800 
ctttcttcaa acatctatgg cctcgttgga tctccgttga tgttgtggtg gttctgatgc 4860 
taaaacgaca aatagtgtta taaccattga agaagaaaag aaaattagag ttgttgtatc 4920 
tgcaaaaatt ttggtagaga cacgcgaacc cgtttggatt ttgttatggt gtaaagaaat 4980 
ttcaatcaaa aaactgttgt aataattgtt accaaaaaga aatgcttttc tggaaacgag 5040 
gggaaaaata gtagttttgt taggttttac tgtttggacc aaatctagta aaaaactttt 5100 
tgtaataagg aaaaaaaaag aacaaatgtg ataaatgcat ggggattgta tgaaaccttc 5160 
caataaagtt gattggtggt cccgttttgg gga 5193 


<210> 4 
<211> 498 
<212> PRT 
<213> mouse 

<400> 4 

Met Gly Asp Arg Gly Gly Ala Gly Ser Ser Arg Arg Arg Arg Thr Gly 
15 10 15 

Ser Arg Val Ser Val Gin Gly Gly Ser Gly Pro Lys Val Glu Glu Asp 
20 25 30 

Glu Val Arg Asp Ala Ala Val Ser Pro Asp Leu Gly Ala Gly Gly Asp 
35 40 45 

Ala Pro Ala Pro Ala Pro Ala Pro Ala His Thr Arg Asp Lys Asp Gly 
50 55 60 


SUBSTITUTE SHEET (RUl£ 26) 


WO 00/36114 


PCT/CA99/01202 


6/20 

Arg Thr Ser Val Gly Asp Gly Tyr Trp Asp Leu Arg Cys His Arg Leu 
65 70 75 80 

Gin Asp Ser Leu Phe Ser Ser Asp Ser Gly Phe Ser Asn Tyr Arg Gly 
85 90 95 

He Leu Asn Trp Cys Val Val Met Leu He Leu Ser Asn Ala Arg Leu 
100 105 110 

Phe Leu Glu Asn Leu He Lys Tyr Gly He Leu Val Asp Pro He Gin 
115 120 125 

Val Val Ser Leu Phe Leu Lys Asp Pro Tyr Ser Trp Pro Ala Pro Cys 
130 135 140 

Val lie He Ala Ser Asn He Phe Val Val Ala Ala Phe Gin He Glu 
145 150 155 160 

Lys Arg Leu Ala Val Gly Ala Leu Thr Glu Gin Met Gly Leu Leu Leu 
165 170 175 

His Val Val Asn Leu Ala Thr He He Cys Phe Pro Ala Ala Val Ala 
180 185 190 

Leu Leu Val Glu Ser He Thr Pro Val Gly Ser Val Phe Ala Leu Ala 
195 200 205 

Ser Tyr Ser He Met Phe Leu Lys Leu Tyr Ser Tyr Arg Asp Val Asn 
210 215 220 

Leu Trp Cys Arg Gin Arg Arg Val Lys Ala Lys Ala Val Ser Thr Gly 
225 230 235 240 

Lys Lys Val Ser Gly Ala Ala Ala Gin Gin Ala Val Ser Tyr Pro Asp 
245 250 255 

Asn Leu Thr Tyr Arg Asp Leu Tyr Tyr Phe He Phe Ala Pro Thr Leu 
260 265 270 

Cys Tyr Glu Leu Asn Phe Pro Arg Ser Pro Arg He Arg Lys Arg Phe 
275 280 285 

Leu Leu Arg Arg Val Leu Glu Met Leu Phe Phe Thr Gin Leu Gin Val 
290 295 300 

Gly Leu He Gin Gin Trp Met Val Pro Thr He Gin Asn Ser Met Lys 
305 310 315 320 

Pro Phe Lys Asp Met Asp Tyr Ser Arg He He Glu Arg Leu Leu Lys 
325 330 335 

Leu Ala Val Pro Asn His Leu He Trp Leu He Phe Phe Tyr Trp Phe 
340 345 350 

Phe His Ser Cys Leu Asn Ala Val Ala Glu Leu Leu Gin Phe Gly Asp 
355 360 365 
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Arg Glu Phe Tyr Arg Asp Trp Trp Asn Ala Glu Ser Val Thr Tyr Phe 
370 375 380 

Trp Gin Asn Trp Asn lie Pro Val His Lys Trp Cys lie Arg His Phe 
385 390 395 400 

Tyr Lys Pro Met Leu Arg His Gly Ser Ser Lys Trp Val Ala Arg Thr 
405 410 415 

Gly Val Phe Leu Thr Ser Ala Phe Phe His Glu Tyr Leu Val Ser Val 
420 425 430 

Pro Leu Arg Met Phe Arg Leu Trp Ala Phe Thr Ala Met Met Ala Gin 
435 440 445 

Val Pro Leu Ala Trp He Val Gly Arg Phe Phe Gin Gly Asn Tyr Gly 
450 455 460 

Asn Ala Ala Val Trp Val Thr Leu He He Gly Gin Pro Val Ala Val 
465 470 475 480 

Leu Met Tyr Val His Asp Tyr Tyr Val Leu Asn Tyr Asp Ala Pro Val 
485 490 495 

Gly Val 


<210> 5 
<211> 488 
<212> PRT 
<213> human 

<400> 5 

Met Gly Asp Arg Gly Ser Ser Arg Arg Arg Arg Thr Gly Ser Arg Pro 
15 10 15 

Ser Ser His Gly Gly Gly Gly Pro Ala Ala Ala Glu Glu Glu Val Arg 
20 25 30 

Asp Ala Ala Ala Gly Pro Asp Val Gly Ala Ala Gly Asp Ala Pro Ala 
35 40 45 

Pro Ala Pro Asn Lys Asp Gly Asp Ala Gly Val Gly Ser Gly His Trp 
50 55 60 

Glu Leu Arg Cys His Arg Leu Gin Asp Ser Leu Phe Ser Ser Asp Ser 
65 70 75 80 

Gly Phe Ser Asn Tyr Arg Gly He Leu Asn Trp Cys Val Val Met Leu 
85 90 95 

He Leu Ser Asn Ala Arg Leu Phe Leu Glu Asn Leu lie Lys Tyr Gly 
100 105 110 
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He Leu Val Asp Pro He Gin Val Val Ser Leu Phe Leu Lys Asp Pro 
115 120 125 

His Ser Trp Pro Ala Pro Cys Leu Val He Ala Ala Asn Val Phe Ala 
130 135 140 

Val Ala Ala Phe Gin Val Glu Lys Arg Leu Ala Val Gly Ala Leu Thr 
145 150 155 160 

Glu Gin Ala Gly Leu Leu Leu His Val Ala Asn Leu Ala Thr He Leu 
165 170 175 

Cys Phe Pro Ala Ala Val Val Leu Leu Val Glu Ser He Thr Pro Val 
180 185 190 

Gly Ser Leu Leu Ala Leu Met Ala His Thr He Leu Phe Leu Lys Leu 
195 200 205 

Phe Ser Tvr Arg Asp Val Asn Ser Trp Cvs Arg Arg Ala Arg Ala Lys 
210 215 220 

Ala Ala Ser Ala Gly Lys Lys Ala Ser Ser Ala Ala Ala Pro His Thr 
225 230 235 240 

Val Ser Tyr Pro Asp Asn Leu Thr Tyr Arg Asp Leu Tyr Tyr Phe Leu 
245 250 255 

Phe Ala Pro Thr Leu Cys Tyr Glu Leu Asn Phe Pro Arg Ser Pro Arg 
260 265 270 

He Arg Lys Arg Phe Leu Leu Arg Arg He Leu Glu Met Leu Phe Phe 
275 280 285 

Thr Gin Leu Gin Val Gly Leu lie Gin Gin Trp Met Val Pro Thr He 
290 295 300 

Gin Asn Ser Met Lys Pro Phe Lys Asp Met Asp Tyr Ser Arg lie He 
305 310 315 320 

Glu Arg Leu Leu Lys Leu Ala Val Pro Asn His Leu He Trp Leu lie 
325 330 335 

Phe Phe Tyr Trp Leu Phe' His Ser Cys Leu Asn Ala Val Ala Glu Leu 
340 345 350 

Met Gin Phe Gly Asp Arg Glu Phe Tyr Arg Asp Trp Trp Asn Ser Glu 
355 360 365 

Ser Val Thr Tyr Phe Trp Gin Asn Trp Asn He Pro Val His Lys Trp 
370 375 380 

Cys lie Arg His Phe Tyr Lys Pro Met Leu Arg Arg Gly Ser Ser Lys 
385 390 395 400 

Trp Met Ala Arg Thr Gly Val Phe Leu Ala Ser Ala Phe Phe His Glu 
405 410 415 


SUBSTITUTE SHffT (RULE 26) 


WO 00/361 14 


PCT/CA99/01202 


Tyr Leu Val Ser Val Pro Leu Arg 
420 

Gly Met Met Ala Gin lie Pro Leu 
435 440 

Gin Gly Asn Tyr Gly Asn Ala Ala 
450 455 

Gin Pro lie Ala Val Leu Met Tyr 
465 470 

Tyr Glu Ala Pro Ala Ala Glu Ala 
485 


9/20 

Met Phe Arg Leu Trp Ala Phe Thr 
425 430 

Ala Trp Phe Val Gly Arg Phe Phe 
445 

Val Trp Leu Ser Leu He He Gly 
460 

Val His Asp Tyr Tyr Val Leu Asn 
475 480 


<210> 6 
<211> 629 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 6 

tgcatgtata cggaagggtt gggtggctcg 
attcatggga tttataatag aacaatatat 
tttgaaaggc gatcttctat atgctattga 
atatgtgtgg ctctgcatgt tctactgctt 
gcttctctgc ttcggggatc gtgaattcta 
agattactgg gagaatgtgg aatatgcctg 
ccttccccgt gcttgcgcac aaggattacc 
ttcccaagcc ccctggaggc ctttccatgg 
ttcaaagcaa agggggnttn cctggggnta 
aannttcccc cgggaaaggg ttgcccaccg 
aattttggga acccgggggg ggccttttt 


tcaatttgca aaactggtca tattcaccgg 60 
aaatcctatt gtcaggaact caaagcatcc 120 
aagagtgttg aagctttcag ttccaaattt 180 
cttccacctt tggttaaaca tattggcaga 240 
caaagattgg tggaatgcaa aaagtgtggg 300 
tccataaatg ggatgggtcc gacatatata 360 
caaagacacc ccggccatta accattggct 420 
gccanggacc cggngtnccc tggcnggccc 480 
aagntccang ggcccttggg gcccanccaa 540 
gggggngaaa aanncccggg ggcaccncgg 600 

629 


<210> 7 
<211> 11 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 7 

His Lys Trp Met Val Arg His He Tyr Phe Pro 
15 10 


<210> 8 
<211> 341 
<212> PRT 

<213> Brassica napus 
<400> 8 

Met Cys Cys Leu Ser Leu Ser He Phe Pro Leu Ala Ala Phe Thr Val 
15 10 15 
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Glu Lys Leu Val Leu Gin Lys Cys lie Ser Glu Pro Val Val lie Phe 
20 25 30 

Leu His Val He lie Thr Met Thr Glu Val Leu Tyr Pro Val Tyr Val 
35 40 45 

Thr Leu Arg Cys Asp Ser Ala Phe Leu Ser Gly Asp Thr Leu Met Leu 
50 55 60 

Leu Thr Cys He Val Trp Leu Lys Leu Val Ser Tyr Ala His Thr Asn 
65 70 75 80 

Tyr Asp He Arg Thr Leu Ala Asn Ser Ser Asp Lys Ala Asn Pro Glu 
85 90 95 

Val Ser Tyr Tyr Val Ser Leu Lys Ser Leu Ala Tyr Phe Met Leu Ala 
100 105 110 

Pro Thr Leu Cys Tyr Gin Pro Ser Tyr Pro Arg Ser Pro Cys He Arg 
115 120 125 

Lys Gly Trp Val Ala Arg Gin Phe Ala Lys Leu Val He Phe Thr Gly 
130 135 140 

Leu Met Gly Phe He He Glu Gin Tyr He Asn Pro He Val Arg Asn 
145 150 155 160 

Ser Lys His Pro Leu Lys Gly Asp Leu Leu Tyr Ala He Glu Arg Val 
165 170 175 

Leu Lys Leu Ser Val Pro Asn Leu Tyr Val Trp Leu Cys Met Phe Tyr 
180 185 190 

Cys Phe Phe His Leu Trp Leu Asn He Leu Ala Glu Leu Leu Cys Phe 
195 200 205 

Gly Asp Arg Glu Phe Tyr Lys Asp Trp Trp Asn Ala Lys Ser Val Gly 
210 215 220 

Asp Tyr Trp Arg Met Trp Asn Met Pro Val His Lys Trp Met Val Arg 
225 230 235 240 

His Val Tyr Phe Pro Cys Leu Arg He Lys He Pro Lys Val Pro Ala 
245 250 255 

He He He Ala Phe Leu Val Ser Ala Val Phe His Glu Leu Cys He 
260 265 270 

Ala Val Pro Cys Arg Leu Phe Asn Leu Trp Ala Phe Met Gly He Met 
275 280 285 

Phe Gin Val Pro Leu Val Phe He Thr Asn Phe Leu Gin Glu Arg Phe 
290 295 300 


Gly Ser Met Val Gly Asn Met He Phe Gly Ser Ala Ser Cys He Phe 
305 310 315 320 
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Gly Gin Pro Met Cys Gly Leu Leu Tyr Tyr His Asp Leu Met Asn Arg 
325 330 335 

Lys Gly Ser Met Ser 
340 


<210> 9 
<211> 503 
<212> PRT 

<213> Brassica napus 
<400> 9 

Met Ala lie Leu Asp Ser Gly Gly Val Ala Val Pro Pro Thr Glu Asn 
15 10 15 

Gly Val Ala Asp Leu Asp Arg Leu His Arg Arg Lys Ser Ser Ser Asp 
20 25 30 

Ser Ser Asn Gly Leu Leu Ser Asp Thr Ser Pro Ser Asp Asp Val Gly 
35 40 45 

Ala Ala Ala Ala Glu Arg Asp Arg Val Asp Ser Ala Ala Glu Glu Glu 
50 55 60 

Ala Gin Gly Thr Ala Asn Leu Ala Gly Gly Asp Ala Glu Thr Arg Glu 
65 70 75 80 

Ser Ala Gly Gly Asp Val Arg Phe Thr Tyr Arg Pro Ser Val Pro Ala 
85 90 95 

His Arg Arg Thr Arg Glu Ser Pro Leu Ser Ser Asp Ala lie Phe Lys 
100 105 110 

Gin Ser His Ala Gly Leu Phe Asn Leu Cvs Val Val Val Leu Val Ala 
115 120 " 125 

Val Asn Ser Arg Leu lie He Glu Asn Leu Met Lys Tyr Gly Trp Leu 
130 135 140 

He Arg Thr Asp Phe Trp Phe Ser Ser Thr Ser Leu Arg Asp Trp Pro 
145 150 155 160 

Leu Phe Met Cys Cys Leu Ser Leu Ser Val Phe Pro Leu Ala Ala Phe 
165 170 175 

Thr Val Glu Lys Met Val Leu Gin Lys Phe He Ser Glu Pro Val Ala 
180 185 190 

He He Leu His Val He lie Thr Met Thr Glu Val Leu Tyr Pro Val 
195 200 205 

Tyr Val Thr Leu Arg Cys Asp Ser Ala Phe Leu Ser Gly Val Thr Leu 
210 215 220 
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Met Leu Leu Thr Cys He Val Trp Leu Lys Leu Val Ser Tyr Ala His 
225 230 235 240 

Thr Ser Tyr Asp He Arg Thr Leu Ala Asn Ser Ala Asp Lys Val Asp 
245 250 255 

Pro Glu He Ser Tyr Tyr Val Ser Leu Lys Ser Leu Ala Tyr Phe Met 
260 265 270 

Val Ala Pro Thr Leu Cys Tyr Gin Pro Ser Tyr Pro Arg Ser Pro Cys 
275 280 285 

He Arg Lys Gly Trp Val Ala Arg Gin Leu Ala Lys Leu Val He Phe 
290 295 300 

Thr Gly Leu Met Gly Phe lie He Glu Gin Tyr He Asn Pro He Val 
305 310 315 320 

Arg Asn Ser Lys His Pro Leu Lys Gly Asp Leu Leu Tyr Ala He Glu 
325 330 335 

Arg Val Leu Lys Leu Ser Val Pro Asn Leu Tyr Val Trp Leu Cys Met 
340 345 350 

Phe Tyr Cys Phe Phe His Leu Trp Leu Asn He Leu Ala Glu Leu Leu 
355 360 365 

Cys Phe Gly Asp Arg Glu Phe Tyr Lys Asp Trp Trp Asn Ala Lys Ser 
370 375 380 

Val Gly Asp Tyr Trp Arg Met Trp Asn Met Pro Val His Lys Trp Met 
385 390 395 400 

Val Arg His Val Tyr Phe Pro Cys Leu Arg He Lys He Pro Lys Val 
405 410 415 

Pro Ala He He He Ala Phe Leu Val Ser Ala Val Phe His Glu Leu 
420 425 430 

Cys He Ala Val Pro Cys Arg Leu Phe Asn Leu Trp Ala Phe Met Gly 
435 440 445 

He Met Phe Gin Val Pro Leu Val Phe He Thr Asn Phe Leu Gin Glu 
450 455 460 

Arg Phe Gly Ser Met Val Gly Asn Met He Phe Gly Ser Ala Ser Cys 
465 470 475 480 

He Phe Gly Gin Pro Met Cys Gly Leu Leu Tyr Tyr His Asp Leu Met 
485 490 495 

Asn Arg Lys Gly Ser Met Ser 
500 
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<2io> 10 

<211> 11 
<212> PRT 

<213> Arabidopsis thaliana 
<220> 

<221> VARIANT 
<222> (1) 

<220> 

<221> VARIANT 
<222> (3) 

<220> 

<221> VARIANT 
<222> (5).. (6) 

<220> 

<221> VARIANT 
<222> (8) . . (10) 

<400> 10 

Xaa Leu Xaa Lys Xaa Xaa Ser Xaa Xaa Xaa Val 
15 10 


<210> 11 
<211> 27 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 11 

Ser His Ala Gly Leu Phe Asn Leu Cys Val Val Val Leu lie Ala Val 
1 5 10 . 15 

Asn Ser Arg Leu lie lie Glu Asn Leu Met Lys 
20 25 


<210> 12 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer of 
DGATXbal 

<400> 12 

ctagtctaga atggcgattt tgga 24 


<210> 13 
<211> 23 


SUBSTITUTE SHEET (RULE 20) 


WO 00/361 14 


PCT/CA99/01202 


14/20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer of 
DGATXhol 

<400> 13 

gcgctcgagt ttcatgacat cga 


<210> 14 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer DGAT1 
<400> 14 

agacacgaat cccattccca ccga 24 


<210> 15 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer DGAT2 
<400> 15 

agtggtgaca acgcagggat gatg 


<210> 16 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer DGAT3 
<400> 16 

atggtcgctc ccacattgtg t 21 


<210> 17 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer DGAT4 
<400> 17 


SUBSTITUTE SHEET (RULE 2$) 


WO 00/361 14 


15/20 

catacaatcc ccatgacatt tatca 


<210> 18 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer A 
<400> 18 

cgaccgtcgg ttccagctca tcgg 


<210> 19 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer B 
<400> 19 

gcggccaatc tcgcagcgat cttg 


<210> 20 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer C 
<400> 20 

taaacagtag actcatcatc g 

<210> 21 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer Gen 
<400> 21 

gagaggatcc acgctcacga cccattcttc ccg 

<210> 22 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Primer Gen 2 
<400> 22 

aagaaggatc catccccaaa acgggaccac caa 33 


<210> 23 
<2X1> 19B5 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 23 

atttcttagc ttcttccttc aatccgctct 
tcaatttctt ctgcatgctt ctcgattctc 
cgtcaaacgc ttttcgaaat ggcgattttg 
aacggtggcg gagagttcgt cgatcttgat 
tcttctaacg gacttcttct ctctggttcc 
gctcccgccg acgttaggga tcggattgat 
gccaatttgg ccggagataa taacggtggt 
ggagaaggaa gaggaaacgc cgatgctacg 
cggagggcga gagagagtcc acttagctcc 
ttattcaacc tctgtgtagt agttcttatt 
cttatgaaga gccatgccgg attattcaac 
agtagactca tcatcgaaaa tcttatgaag 
tttagttcaa gatcgctgcg agattggccg 
tttcctttgg ctgcctttac ggttgagaaa 
gttgtcatct ttcttcatat tattatcacc 
accctaaggt gtgattctgc ttttttatca 
gtgtggctaa agttggtttc ttatgctcat 
gcagctgata aggccaatcc tgaagtctcc 
ttcatggtcg ctcccacatt gtgttatcag 
aagggttggg tggctcgtca atttgcaaaa 
ataatagaac aatatataaa tcctattgtc 
cttctatatg ctattgaaag agtgttgaag 
tgcatgttct actgcttctt ccacctttgg 
ggggatcgtg aattctacaa agattggtgg 
atgtggaata tgcctgttca taaatggatg 
agcaagatac caaagacact cgccattatc 
gagctatgca tcgcagttcc ttgtcgtctc 
tttcaggtgc ctttggtctt catcacaaac 
gggaacatga tcttctggtt catcttctgc 
tattaccacg acctgatgaa ccgaaaagga 
gactttcttc aaacatctat ggcctcgttg 
gctaaaacga caaatagtgt tataaccatt 
tctgcaaaaa ttttggtaga gacacgcaaa 
gccgc 


ttccctctcc attagattct gtttcctctt 60 
tctgacgcct cttttctccc gacgctgttt 120 
gattctgctg gcgttactac ggtgacggag 180 
aggcttcgtc gacggaaatc gagatcggat 240 
gataataatt ctccttcgga tgatgttgga 300 
tccgttgtta acgatgacgc tcagggaaca 360 
ggcgataata acggtggtgg aagaggcggc 420 
tttacgtatc gaccgtcggt tccagctcat 480 
gacgcaatct tcaaacagag ccatgccgga 540 
gctgtaaaca gtagactcat catcgaaaat 600 
ctctgtgtag tagttcttat tgctgtaaac 660 
tatggttggt tgatcagaac ggatttctgg 720 
cttttcatgt gttgtatatc cctttcgatc 780 
ttggtacttc agaaatacat atcagaacct 840 
atgacagagg ttttgtatcc agtttacgtc 900 
ggtgtcactt tgatgctcct cacttgcatt 960 
actagctatg acataagatc cctagccaat 1020 
tactacgtta gcttgaagag cttggcatat 1080 
ccaagttatc cacgttctgc atgtatacgg 1140 
ctggtcatat tcaccggatt catgggattt 1200 
aggaactcaa agcatccttt gaaaggcgat 1260 
ctttcagttc caaatttata tgtgtggctc 1320 
ttaaacatat tggcagagct tctctgcttc 1380 
aatgcaaaaa gtgtgggaga ttactggaga 1440 
gttcgacata tatacttccc gtgcttgcgc 1500 
attgctttcc tagtctctgc agtctttcat 1560 
ttcaagctat, gggcttttct tgggattatg 1620 
tatctacagg aaaggtttgg ctcaacggtg 1680 
attttcggac aaccgatgtg tgtgcttctt 1740 
tcgatgtcat gaaacaactg ttcaaaaaat 1800 
gatctccgtt gatgttgtgg tggttctgat 1860 
gaagaagaaa agaaaattag agttgttgta 1920 
cccgtttgga ttttgttatg gtgtaaagcg 1980 

1985 


<210> 24 
<211> 5339 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 24 

gctcacgacc cattcttccc gttccatttg gttttattta tttcaaagtt taatattcct 60 
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tttgtataac attcaaatct tcacatgatt gattgtgtga aaaccccaca gattttacta 120 
caataggggg agttgactta aaatagctat tgatgtcgaa aaaatgtatt ttagttataa 180 
attatactaa agaaaatttt tgatttgtct gttgtttaag catatgtatt gttaaactta 240 
aaaaaatatg tattgttaat cttaaaaatg taggagtaca catcaaatac tcgagcataa 300 
tcaaaaccgt attcatagac cgatgtgaga atcaaataga agataatgtg attttttaaa 360 
atatcgtatc tccaaatcaa tcacttagaa gataatgtaa ttctttatgt gctacataaa 420 
taaatatata tatatatata tatatatatc ttgtatatat gtcttgacaa aaaattgcca 480 
gtcaaaaacc atgactgaat caaactataa gtcggattga atcaaactat aagtcggatg 540 
agtattaatt tccattatgt ttctatactt tacaaaccgg aaaatagata ttatagatac 600 
caaaaaagta gatttgtgta tattattaga agatttggaa tttcatcatt atcaggatct 660 
aaagtacttc cctaattaaa tcatgtcggt tgaaaaagct caatgaatgt ttgaaatttg 720 
gaaagtttat taaattcgga tctttttttt ttgtttgtcg tcccaaacat ttttatttta 780 
ttacaaataa tcaacttatc cttactacta aatcatttca tatctttgat accaacaaat 840 
catttcatat tctattttga tgtttaagaa aacactattt accagttaca aaatattata 900 
aggattgttg tttagaaaaa aaagtacaag ttgaattctt tttgtcaaat ataaaattga 960 
ctttttaata tataattgac ttattgaaca tgattacaga attaatcatc tacaaaactt 1020 
tccaagttta taataaatac atttcaaaga ctattagttc ttcttaaaat atttctaaaa 1080 
gtgatcaaag actaccacat ataattcaga aaaagtagaa gttgatttct ttttgtcaaa 1140 
taaataattg acttaaaata gtttggaaag ccattgaact tgattataga attgataatg 1200 
tacataaaaa aattccaagt ttataataaa tacatttttc aaatgctata tcagttcttc 1260 
ttaaaatatt tcactaaaaa aacactcaaa tatagaataa atttattgaa taacatacca 1320 
actgtaaaac agaatttgac aaaaaaaaaa aaaaaatgaa atgaagatga agacaaaaat 1380 
aaatcaccag aggatcttat gcaaaaaaat atatgaatac acaataaacc atattgatat 1440 
ttttaaaata aaataaaaac agaaaaatat cccaacaccg cttttcaatt aaaaatcttc 1500 
cgtcaccatt gttgtcatct tcctctctcg tgaatccttt ttcctttctt cttcttcttc 1560 
tcttcagaga aaactttgct tctctttcta taaggaacca gacacgaatc ccattcccac 1620 
cgatttctta gcttcttcct tcaatccgct ctttccctct ccattagatt ctgtttcctc 1680 
tttcaatttc ttctgcatgc ttctcgattc tctctgacgc ctcttttctc ccgacgctgt 1740 
ttcgtcaaac gcttttcgaa atggcgattt tggattctgc tggcgttact acggtgacgg 1800 
agaacggtgg cggagagttc gtcgatcttg ataggcttcg tcgacggaaa tcgagatcgg 1860 
attcttctaa cggacttctt ctctctggtt ccgataataa ttctccttcg gatgatgttg 1920 
gagctcccgc cgacgttagg gatcggattg attccgttgt taacgatgac gctcagggaa 1980 
cagccaattt ggccggagat aataacggtg gtggcgataa taacggtggt ggaagaggcg 2040 
gcggagaagg aagaggaaac gccgatgcta cgtttacgta tcgaccgtcg gttccagctc 2100 
atcggagggc gagagagagt ccacttagct ccgacgcaat cttcaaacag gtttaaaatc 2160 
tcagaaatct tcgaatttgg tgtttgcttg ttgttttata tggaattgag tttggtgatt 2220 
gttttgcatt gcagagccat gccggattat tcaacctctg tgtagtagtt cttattgctg 2280 
taaacagtag actcatcatc gaaaatctta tgaaggtttg ctgttacttg tttctccttt 2340 
taggaattga attgcttgaa aatttatcat tgcattgcag agccatgccg gattattcaa 2400 
cctctgtgta gtagttctta ttgctgtaaa cagtagactc atcatcgaaa atcttatgaa 2460 
ggtttgctgt tacttgtttc tccttttagg aattgaattg cttgaaaatt tatcagagac 2520 
gaataacttt gttgttgcta tcattcatgt agtatggttg gttgatcaga acggatttct 2580 
ggtttagttc aagatcgctg cgagattggc cgcttttcat gtgttggtaa aagaagatgt 2640 
tttttatttc cagcaatgtt acattgttat acgtataatg atgagtttag tgatcaagtt 2700 
cctctttgat tcttctttct tgttgcagta tatccctttc gatctttcct ttggctgcct 2760 
ttacggttga gaaattggta cttcagaaat acatatcaga acctgtgagt aattactatt 2820 
ctccagccat tactgtaatt tttattgaag acaagtttgt atcatgaaga acttacaagt 2880 
tctgttttga aaatgctcaa ggttgtcatc tttcttcata ttattatcac catgacagag 2940 
gttttgtatc cagtttacgt caccctaagg tgatactgtt tttctggtct cagtttgtga 3000 
tactgttttt aagtttagtt gtctgacccg gtgatcttga aaatggacag gtgtgattct 3060 
gcttttttat caggtgtcac tttgatgctc ctcacttgca ttgtgtggct aaagttggtt 3120 
tcttatgctc atactagcta tgacataaga tccctagcca atgcagctga taaggtaaaa 3180 
tacgaaaaag aagcgtatgt attagtcact tgcactgtgt tactgtttta accaaacact 3240 
gttatgaact ttaggccaat cctgaagtct cctactacgt tagcttgaag agcttggcat 3300 
atttcatggt cgctcccaca ttgtgttatc aggtaactgc aaagtgcatc aaccattctt 3360 
atacttgcaa gagtttcttg tctaaacctc ggatctttgc ttttccccag ccaagttatc 3420 
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cacgttctgc atgtatacgg aagggttggg tggctcgtca atttgcaaaa ctggtcatat 3480 
tcaccggatt catgggattt ataatagaac aagtacgttt tcacatcttg ctttattagt 3540 
tttccttggt gaaaatcatc atccctgcgt tgtcaccact tgacttcatg ttcttttgtt 3600 
acattttggc agtatataaa tcctattgtc aggaactcaa agcatccttt gaaaggcgat 3660 
cttctatatg ctattgaaag agtgttgaag ctttcagttc caaatttata tgtgtggctc 3720 
tgcatgttct actgcttctt ccacctttgg tatgctgtga tcccatctct ttcaaaataa 3780 
tttgcaaatt cgaaaaaccg aaaaaggcta aatctcatac gaatttgata tttttagttt 3840 
cttagagtcg gtgatgtaat ttcagttact gaacgcaaat ctcttgtcca aaggttaaac 3900 
atattggcag agcttctctg cttcggggat cgtgaattct acaaagattg gtggaatgca 3960 
aaaagtgtgg gagatgtgag ctattttact caaaagaaaa cttatgattt ttaatgttgt 4020 
cgttgttttt gggtcatcta actaaccaaa ttcatgtatt cactgtcttc ctttatcagt 4080 
actggagaat gtggaatatg gtatggtfcct cttcctaaac atcaccttct tttgtacaca 4140 
aaatagaaga agagagctaa ttaagatctt gttttccttg acagcctgtt cataaatgga 4200 
tggttcgaca tatatacttc ccgtgcttgc gcagcaagat accaaaggtg agtgagatat 4260 
ataccgatat gcaattgtcg agatttgttt ctgtgatata aatttaaccc tccacacact 4320 
tgtttttcag acactcgcca ttatcattgc tttcctagtc tctgcagtct ttcatgaggt 4380 
atacatactt tctacattgc cctgtctcta gacgcatgaa cacacgctag tgaaagaaat 4440 
gctaatattc aaagcattgt ttttacttaa cgatcttgtg ttacaaattt ccttttgaca 4500 
gctatgcatc gcagttcctt gtcgtctctt caagctatgg gcttttcttg ggattatgtt 4560 
tcaggttaaa aaattactaa actgctgcag tcgattttta ctaaactcta atctcatatt 4620 
ctgaccaacc aatttgtttg agtaggtgcc tttggtcttc atcacaaact atctacagga 4680 
aaggtttggc tcaacggtat gctctcaaaa cccgagaaaa tagaacgaat aactctttct 4740 
ttcatagcct agccatttaa atcgcaatgc tgaaacttaa taataaaggt gatctgtttt 4800 
ggaatgggat catattatta ggtggggaac atgatcttct ggttcatctt ctgcattttc 4860 
ggacaaccga tgtgtgtgct tctttattac cacgacctga tgaaccgaaa aggatcgatg 4920 
tcatgaaaca actgttcaaa aaatgacttt cttcaaacat ctatggcctc gttggatctc 4980 
cgttgatgtt gtggtggttc tgatgctaaa acgacaaata gtgttataac cattgaagaa 5040 
gaaaagaaaa ttagagttgt tgtatctgca aaaattttgg tagagacacg cgaacccgtt 5100 
tggattttgt tatggtgtaa agaaatttca atcaaaaaac tgttgtaata attgttacca 5160 
aaaagaaatg cttttctgga aacgagggga aaaatagtag ttttgttagg ttttactgtt 5220 
tggaccaaat ctagtaaaaa actttttgta ataaggaaaa aaaaagaaca aatgtgataa 5280 
atgcatgggg attgtatgaa accttccaat aaagttgatt ggtggtcccg ttttgggga 5339 


<210> 25 
<211> 547 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 25 

Met Ala lie Leu Asp Ser Ala Gly Val Thr Thr Val Thr Glu Asn Gly 
1 5 10 15 

Gly Gly Glu Phe Val Asp Leu Asp Arg Leu Arg Arg Arg Lys Ser Arg 
20 25 30 

Ser Asp Ser Ser Asn Gly Leu Leu Leu Ser Gly Ser Asp Asn Asn Ser 
35 40 45 

Pro Ser Asp Asp Val Gly Ala Pro Ala Asp Val Arg Asp Arg lie Asp 
50 55 60 

Ser Val Val Asn Asp Asp Ala Gin Gly Thr Ala Asn Leu Ala Gly Asp 
65 70 75 80 

Asn Asn Gly Gly Gly Asp Asn Asn Gly Gly Gly Arg Gly Gly Gly Glu 
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85 


90 


95 


Gly Arg Gly Asn Ala Asp Ala Thr Phe Thr Tyr Arg Pro Ser Val Pro 
100 105 110 

Ala His Arg Arg Ala Arg Glu Ser Pro Leu Ser Ser Asp Ala lie Phe 
115 120 125 

Lys Gin Ser His Ala Gly Leu Phe Asn Leu Cys Val Val Val Leu lie 
130 135 140 

Ala Val Asn Ser Arg Leu He He Glu Asn Leu Met Lys Ser His Ala 
145 150 155 160 

Gly Leu Phe Asn Leu Cys Val Val Val Leu lie Ala Val Asn Ser Arg 
165 170 175 

Leu He He Glu Asn Leu Met Lys Tyr Gly Trp Leu He Arg Thr Asp 
180 185 190 

Phe Trp Phe Ser Ser Arg Ser Leu Arg Asp Trp Pro Leu Phe Met Cys 
195 200 205 

Cys He Ser Leu Ser He Phe Pro Leu Ala Ala Phe Thr Val Glu Lys 
210 215 220 

Leu Val Leu Gin Lys Tyr He Ser Glu Pro Val Val He Phe Leu His 
225 230 235 240 

He He He Thr Met Thr Glu Val Leu Tyr Pro Val Tyr Val Thr Leu 
245 250 255 

Arg Cys Asp Ser Ala Phe Leu Ser Gly Val Thr Leu Met Leu Leu Thr 
260 265 270 

Cys He Val Trp Leu Lys Leu Val Ser Tyr Ala His Thr Ser Tyr Asp 
275 280 285 

He Arg Ser Leu Ala Asn Ala Ala Asp. Lys Ala Asn Pro Glu Val Ser 
290 295 300 

Tyr Tyr Val Ser Leu Lys Ser Leu Ala Tyr Phe Met Val Ala Pro Thr 
305 310 315 320 

Leu Cys Tyr Gin Pro Ser Tyr Pro Arg Ser Ala Cys He Arg Lys Gly 
325 330 335 

Trp Val Ala Arg Gin Phe Ala Lys Leu Val He Phe Thr Gly Phe Met 
340 345 350 

Gly Phe He He Glu Gin Tyr He Asn Pro He Val Arg Asn Ser Lys 
355 360 365 

His Pro Leu Lys Gly Asp Leu Leu Tyr Ala He Glu Arg Val Leu Lys 


370 


375 


380 
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Leu Ser Val Pro Asn Leu Tyr Val Trp Leu Cys Met Phe Tyr Cys Phe 
385 390 395 400 

Phe His Leu Trp Leu Asn lie Leu Ala Glu Leu Leu Cys Phe Gly Asp 
405 410 415 

Arg Glu Phe Tyr Lys Asp Trp Trp Asn Ala Lys Ser Val Gly Asp Tyr 
420 425 430 

Trp Arg Met Trp Asn Met Pro Val His Lys Trp Met Val Arg His lie 
435 440 445 

Tyr Phe Pro Cys Leu Arg Ser Lys lie Pro Lys Thr Leu Ala lie He 
450 455 460 

He Ala Phe Leu Val Ser Ala Val Phe His Glu Leu Cys He Ala Val 
465 470 475 480 

Pro Cys Arg Leu Phe Lys Leu Trp Ala Phe Leu Gly lie Met Phe Gin 
485 490 495 

Val Pro Leu Val Phe He Thr Asn Tyr Leu Gin Glu Arg Phe Gly Ser 
500 505 510 

Thr Val Gly Asn Met He Phe Trp Phe He Phe Cys He Phe Gly Gin 
515 520 525 

Pro Met Cys Val Leu Leu Tyr Tyr His Asp Leu Met Asn Arg Lys Gly 
530 535 540 

Ser Met Ser 
545 
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